INFORMAL SEQUENCE LISTING 



SEQ ID NO:l 
SIZE: 10 
PRT-- HIV-Tat 

Ty r - G 1 y - Ar g - Ly s - Ly s - Ar g - Ar g -Gin- Ar g - Ar g - Ar g 

SEQ ID NO: 2 
SIZE: 11 

PRT HIV-Tat Variant 

Tyr-Ala-Arg-Lys-Ala-Arg-Arg-Gln-Ala-Arg-Arg 

SEQ ID NO: 3 
SIZE: 11 

PRT-HIV-Tat Variant 

Tyr-Ala-Arg-Ala-Ala-Ala-Arg-Gln-Ala-Arg-Ala 

SEQ ID NO: 4 
SIZE: 11 

PRT-HIV-Tat Variant 

Tyr- Ala -Arg- Ala -Ala -Arg- Ala -Ala- Arg- Arg- Arg 

SEQ ID NO: 5 
SIZE: 11 

PRT: HIV-Tat Variant 

Tyr -Ala -Arg -Ala -Ala -Arg -Ala -Ala -Arg -Arg -Ala 

SEQ ID NO: 6 
SIZE: 11 
HIV-Tat Variant 

Tyr-Ala-Arg-Arg-Arg- Ar g - Ar g - Arg - Ar g - Ar g - Ar g 

SEQ ID NO: 7 
SIZE: 11 

PRT-HIV-Tat Variant 

Tyr- Ala-Ala-Ala- Ala- Arg-Arg- Arg- Arg-Arg-Arg 

SEQ ID NO: 8 
SIZE: 11 

PRT-HIV-Tat Variant 

Ala-Gly-Arg-Lys-Lys-Arg-Arg-Gln-Arg-Arg-Arg 

SEQ ID NO: 9 
SIZE: 34 
PRT-HSV VP22 

Asp-Ala-Ala-Thr-Ala-Thr-Arg-Gly-Arg-Ser-Ala-Ala-Ser-Arg-Pro-Thr-Glu-Arg- 
Pro-Arg-Ala-Pro-Ala-Arg-Ser-Ala-Ser-Arg-Pro-Arg-Arg-Pro-Val-Glu 
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SEQ ID NO: 10 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, Penetratin- 1 

Arg-Gln- lie -Lys - lie -Trp- Phe -Gin- Asn- Arg -Arg -Met -Lys -Trp- Lys -Lys 

SEQ ID NO: 11 
SIZE: 16 

PRT-Antennapedia third Helix, 53-43 

Lys -Lys -Trp -Lys -Met -Arg -Arg -Asn -Gin -Phe -Trp- lie -Lys- Ile-Gln-Arg 

SEQ ID NO: 12 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, D-amino acids 

Arg-Gln- lie -Lys- lie -Trp -Phe -Gin -Asn- Arg -Arg -Met -Lys -Trp -Lys -Lys 

SEQ ID NO: 13 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, Pro50, 

Arg -Gin- I le -Lys - I le -Trp -Phe -Pro -Asn -Arg -Arg -Met- Lys -Trp-Lys- Lys 

SEQ ID NO: 14 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, 3 -Pro 

Arg-Gln-Pro-Lys-Ile-Trp-Phe-Pro-Asn-Arg-Arg-Lys-Pro-Trp-Lys-Lys 

SEQ ID NO: 15 
SIZE : 

PRT-Antennapedia third Helix, 43-58, R52M/M54R, 

Arg-Gln- lie -Lys -He -Trp- Phe -Gln-Asn-Met -Arg -Arg -Lys -Trp -Lys -Lys) 

SEQ ID NO: 16 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, 7-Arg 

Arg-Gln-Ile-Arg-Ile-Trp-Phe-Gln-Asn-Arg-Arg-Met-Arg -Trp-Arg -Arg 

SEQ ID NO: 17 
SIZE: 16 

PRT-Antennapedia third Helix, 43-58, W/R 

Arg - Arg - Trp - Arg - Arg - Trp - Tr p - Arg - Arg - T rp - Trp - Arg - Arg - Trp - Arg - Arg 

SEQ ID NO: 18 
SIZE: 16 

PRT-Kaposi's FGF signal sequence 

Ala - Ala - Val - Ala -Leu - Leu -Pro- Ala -Val -Leu -Leu -Ala -Leu -Leu -Ala -Pro 

SEQ ID NO: 19 
SIZE: 20 

PRT: amino terminal secretory signal of human IL-2 

Met-Tyr-Arg-Met-Gln-Leu-Leu-Ser-Cys-Ile-Ala-Leu-Ser-Leu-Ala-Leu-Val 
Asn-Ser 
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SEQ ID NO: 20 
SIZE: 20 

PRT-IL-2-4 signal sequence 

Met-Tyr-Arg-Met -Ala -Leu -Leu -Ser -Cys - lie -Ala -Leu -Ser -Leu -Ala -Leu-Val -Thr 
Asn-Ser 



SEQ ID NO: 21 
SIZE: 202 

PRT-HSV VP22 sequence 



Met 


-Thr 


-Ser 


-Arg 


-Arg 


-Ser 


-Val 


-Lys 


-Ser 


-Gly 


-Lys 


-Arg 


-Glu 


-Val 


-Lys 


-Arg 


-Asp 


-Glu 


Tyr 


-Glu 


-Asp 


-Leu 


-Tyr 


-Tyr 


-Thr 


-Lys 


-Ser 


-Ser 


-Gly 


-He 


-Ala 


-Ser 


-Lys 


-Asp 


-Ser 


-Lys 


Lys 


-Asp 


-Thr 


-Ser 


-Arg 


-Arg 


-Gly 


-Ala 


-Leu 


-Gin 


-Thr 


-Arg 


-Ser 


-Arg 


-Gin 


-Arg 


-Gly 


-Glu 


Val 


-Arg 


-Phe 


-Val 


-Gin 


-Tyr 


-Asp 


-Glu 


-Ser 


-Asp 


-Tyr 


-Ala 


-Leu 


-Tyr 


-Gly 


-Gly 


-Ser 


-Ser 


Ser 


-Glu 


-Asp 


-Asp 


-Glu 


-His 


-Pro 


-Glu 


-Val 


-Lys 


-Arg 


-Thr 


-Arg 


-Arg 


-Lys 


-Val 


-Ser 


-Gly 


Ala 


-Val 


-Leu 


-Ser 


-Gly 


-Lys 


-Gly 


-Lys 


-Ala 


-Arg 


-Ala 


-Lys 


-Lys 


-Lys 


-Lys 


-Ala 


-Gly 


-Ser 


Gly 


-Gly 


-Ala 


-Gly 


-Arg 


-Thr 


-Lys 


-Thr 


-Thr 


-Ala 


-Lys 


-Arg 


-Ala 


-Lys 


-Arg 


-Thr 


-Gin 


-Arg 


Val 


-Ala 


-Thr 


-Lys 


-Ala 


-Lys 


-Ala 


-Ala 


-Lys 


-Ala 


-Ala 


-Glu 


-Thr 


-Thr 


-Arg 


-Gly 


-Arg 


-Lys 


Ser 


-Ala 


-Gin 


-Lys 


-Glu 


-Ser 


-Ala 


-Ala 


-Leu 


-Lys 


-Asp 


-Ala 


-Lys 


-Ala 


-Ser 


-Thr 


-Ala 


-Lys 


Thr 


-Arg 


-Ser 


-Lys 


-Thr 


-Lys 


-Ala 


-Gin 


-Gly 


-Leu 


-Ala 


-Arg 


-Lys 


-Leu 


-His 


-Phe 


-Ser 


-Thr 


Ala 


-Lys 


-Lys 


-Asn 


-Lys 


-Asp 


-Ala 


-Lys 


-Trp 


-Thr 


-Lys 


-Arg 


-Val 


-Ala 


-Gly 


-Phe 


-Asn 


-Lys 


Arg 


-Val 


-Phe 


-Cys 


-Ala 


-Ala 


-Val 


-Gly 


-Arg 


-Leu 


-Ala 


-Ala 


-Met 


-His 


-Ala 


-Arg 


-Met 


-Ala 


Ala 


-Val 


-Gin 


-Leu 


-Trp 


-Asp 


-Met 


-Ser 


-Arg 


-Lys 


-Arg 


-Thr 


-Asp 


-Glu 


-Asp 


-Leu 


-Asn 


-Glu 


Leu 


-Leu 


-Gly 


-lie 


-Thr 


-Thr 


-He 


-Arg 


-Val 


-Thr 


-Val 


-Cys 


-Glu 


-Gly 


-Lys 


-Asn 


-Leu 


-Leu 


Gin 


-Arg 


-Ala 


-Asn 


-Glu 


-Leu 


-Val 


-Asn 


-Lys 


-Asp 


-Val 


-Val 


-Gin 


-Asp 


-Val 


-Asp 


-Ala 


-Ala 


Thr 


-Ala 


-Thr 


-Arg 


-Gly 


-Arg 


-Ser 


-Ala 


-Ala 


-Ser 


-Arg 


-Lys 


-Thr 


-Glu 


-Arg 


-Lys 


-Arg 


-Ala 


Lys 


-Ala 


-Arg 


-Ser 


-Ala 


-Ser 


-Arg 


-Lys 


-Arg 


-Arg 


-Lys 


-Val 


-Glu 


-Ser 











SEQ ID NO: 2 2 
SIZE: 23 

DNA-T7 RNAP promoter: 



TAATACGACTCACTATAGGGAGA 

SEQ ID NO: 2 3 
SIZE 23 

DNA -SP6 RNAP promoter: 

ATTTAGGTGACACTATAGAAGAA 

SEQ ID NO: 24 
SIZE 23 

DNA-T3 RNAP promoter: 

AATTAACCCTCACTAAAGGGAGA 

SEQ ID NO: 2 5 
SIZE 23 

DNA-K11 RNAP promoter: 

AATTAGGGCACACTATAGGGAGA 

SEQ ID NO: 26 
SIZE 24 

PRT IL-4 signal sequence 

Met -Gly -Leu -Thr -Ser -Gin -Leu -Leu -Pro -Pro -Leu -Phe -Phe -Leu -Leu -Ala -Cys -Ala 
Gly-Asn-Phe-Val-His-Gly 
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SEQ ID NO: 27 
SIZE 302 
PRT HSV VP2 2 



Met 


-Thr 


-Ser 


rti y 


-Arn 
t\L y 


-Ser 


Tvr 


-Glu 


-Asp 


- Leu 


- Tvr 


- Tvr 


Pro 


-Asp 


-Thr 


-Ser 


-Ara 


-Ara 


Val 


-Arg 


-Phe 


-Val 


-Gin 


- Tvr 


Se r 


-Glu 


-Asp 


- Asp 


-Glu 


-His 


Ala 


-Val 


-Leu 


-Ser 


-Glv 


- Pro 


Glv 


-Gly-Ala 


-Glv 


-Ara 


-Thr 


Val 


-Ala 


-Thr 


-Lys 


-Ala 


-Pro 


Ser 


-Ala 


-Gin 


-Pro 


-Glu 


-Ser 


Thr 


-Arg 


-Ser 


-Lys 


-Thr 


-Pro 


Ala 


-Pro 


-Pro 


-Asn 


-Pro 


-Asp 


Arg 


-Val 


-Phe 


-Cys 


-Ala 


-Ala 


Ala 


-Val 


-Gin 


-Leu 


-Trp 


-Asp 


Leu 


-Leu 


-Gly 


-He 


-Thr 


-Thr 


Gin 


-Arg 


-Ala 


-Asn 


-Glu 


-Leu 


Thr 


-Ala 


-Thr 


-Arg 


-Gly 


-Arg 


Pro 


-Ala 


-Arg 


-Ser 


-Ala 


-Ser 



Val 


- Lys 


-Ser 


-Glv 


~ Pro 


-Ara 


Thr 


- Pro 


-Ser 


- Ser 


-Glv 


-Met 


Glv 


-Ala 


- Leu 


-Gin 


-Thr 


-Ara 


Asp 


-Glu 


-Ser 


-Asp 


-Tvr 


-Ala 


Pro 


-Glu 


-Val 


- Pro 


- Ara 
y 


- Thr 


Glv 


- Pro 


-Ala 


- Ara 
y 


-Ala 


- Pro 


Pro 


-Thr 


-Thr 


-Ala 


- Pro 


-Ara 


Ala 


-Ala 


-Pro 


-Ala 


-Ala 


-Glu 


Ala 


-Ala 


-Leu 


-Pro 


-Asp 


-Ala 


Ala 


-Gin 


-Gly 


-Leu 


-Ala 


-Arg 


Ala 


-Pro 


-Trp 


-Thr 


-Pro 


-Arg 


Val 


-Gly 


-Arg 


-Leu 


-Ala 


-Ala 


Met 


-Ser 


-Arg 


-Pro 


-Arg 


-Thr 


lie 


-Arg 


-Val 


-Thr 


-Val 


-Cys 


Val 


-Asn 


-Pro 


-Asp 


-Val 


-Val 


Ser 


-Ala 


-Ala 


-Ser 


-Arg 


-Pro 


Arg 


-Pro 


-Arg 


-Arg 


-Pro 


-Val 



Glu 


-Val 


- Pro 


-Ara 


-Asp 


-Glu 


Ala 


- Ser 


- Pro 


-Asp 


- Ser 


- Pro 


Ser 


-Ara 


-Gin 


-Ara 


-Glv 


-Glu 


Leu 


- Tvr 


-Glv 


-Glv 


- Ser 


-Ser 


Ara 


-Ara 


- Pro 


-Val 


- Ser 


-Glv 


Pro 


- Pro 


- pro 


-Ala 


-Glv 


-Ser 


Ala 


- Pro 


-Ara 


-Thr 


-Gin 


-Ara 


Thr 


-Thr 


-Arg 


-Gly 


-Arg 


-Lys 


Pro 


-Ala 


-Ser 


-Thr 


-Ala 


-Pro 


Lys 


-Leu 


-His 


-Phe 


-Ser 


-Thr 


Val 


-Ala 


-Gly 


-Phe 


-Asn 


-Lys 


Met 


-His 


-Ala 


-Arg 


-Met 


-Ala 


Asp 


-Glu 


-Asp 


-Leu 


-Asn 


-Glu 


Glu 


-Gly 


-Lys 


-Asn 


-Leu 


-Leu 


Gin 


-Asp 


-Val 


-Asp 


-Ala 


-Ala 


Thr 


-Glu 


-Arg 


-Pro 


-Arg 


-Ala 


Glu 


-Gly 











SEQ ID NO: 28 
SIZE 6 

PRT: Artificial 



Arg - Arg - Arg - Arg -Gly- Cys 

SEQ ID NO: 29 
SIZE 7 

PRT: Artificial 



Arg -Arg -Arg -Arg -Arg - Gly - Cys 

SEQ ID NO: 30 
SIZE 8 

PRT: Artificial 



Arg -Arg -Arg -Arg -Arg -Arg -Gly - Cys 

SEQ ID NO: 31 
SIZE 9 

PRT: Artificial 



Arg -Arg -Arg -Arg -Arg -Arg -Arg - Gly- Cys 

SEQ ID NO: 32 
SIZE 10 

PRT: Artificial 



Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - G 1 y - Cy s 

SEQ ID NO: 33 
SIZE 11 

PRT: Artificial 



Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - G 1 y - Cy s 
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SEQ ID NO: 34 
SIZE 12 

PRT : Artificial 

Ar g - Ar g - Ar g - Ar g - Ar g - A r g - Ar g - Ar g - Ar g - Ar g - G 1 y - Cy s 

SEQ ID NO: 35 
SIZE 13 

PRT: Artificial 

Ar g - Arg - Ar g - Ar g - Ar g - Arg - Ar g - Ar g - Ar g - Ar g - Arg -Gly- Cy s 

SEQ ID NO: 36 
SIZE 14 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg -Gly- Cy s 

SEQ ID NO: 37 
SIZE 15 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - G ly - Cy s 

SEQ ID NO: 38 
SIZE 16 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - G 1 y - Cy s 

SEQ ID NO: 39 
SIZE 17 

PRT: Artificial 

Arg - Arg - Arg - Arg -Arg -Arg -Arg -Arg -Arg -Arg -Arg -Arg -Arg -Arg -Arg- Gly- Cys 

SEQ ID NO: 40 
SIZE 18 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg -Gly- Cys 

SEQ ID NO: 41 
SIZE 19 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - G 1 y - Cys 

SEQ ID NO: 42 
SIZE 20 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - 
Gly-Cys 

SEQ ID NO: 43 
SIZE 21 

PRT: Artificial 

Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - Arg - 
Arg-Gly-Cys 
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SEQ ID NO: 44 
SIZE 22 

PRT: Artificial 

Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg-Arg 
Ar g - Ar g - G 1 y - Cy s 

SEQ ID NO: 45 
SIZE 22 

PRT: Kaposi's FGF signal sequence - ful 1 length 

Met-Ser-Gly-Asp-Gly-Thr-Ala-Ala-Val-Ala-Leu-Leu-Pro-Ala-Val-Leu-Leu-Ala 
Leu - Leu - Al a - Pro 

SEQ ID NO: 46 
SIZE 10769 
DNA: ROll 

TGGCCATTGC ATACGTTGTA T C CAT AT CAT AATATGTACA TTTATATTGG CTCATGTCCA 

ACATTACCGC CATGTTGACA TTGATTATTG 90 

ACTAGTTATT AATAGTAATC AATTACGGGG TCATTAGTTC AT AG C C CAT A TATGGAGTTC 

CGCGTTACAT AACTTACGGT AAATGGCCCG 180 

CCTGGCTGAC CGCCCAACGA CCCCCGCCCA TTGACGTCAA TAATGACGTA TGTTCCCATA 

GTAACGCCAA TAGGGACTTT CCATTGACGT 270 

CAATGGGTGG AGTATTTACG GTAAACTGCC CACTTGGCAG TACATCAAGT GTATCATATG 

CCAAGTACGC CCCCTATTGA CGTCAATGAC 3 60 

GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT TATGGGACTT TCCTACTTGG 

CAGTACATCT ACGTATTAGT CATCGCTATT 450 

AC C ATGGTG A TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG 

GGATTTCCAA GTCTCCACCC CATTGACGTC 54 0 

AATGGGAGTT TGTTTTGGCA CCAAAATCAA CGGGACTTTC CAAAATGTCG TAACAACTCC 

GCCCCATTGA CGCAAATGGG CGGTAGGCGT 63 0 

GTACGGTGGG AGGTCTATAT AAGCAGAGCT CGTTTAGTGA ACCGTCAGAT CGCCTGGAGA 

CGCCATCCAC GCTGTTTTGA C C T C CAT AG A 72 0 

AGACACCGGG ACCGATCCAG CCTCCGCGGC CGGGAACGGT GCATTGGAAC GCGGATTCCC 

CGTGCCAAGA GTGACGTAAG TACCGCCTAT 810 

AGACTCTATA GGCACACCCC TTTGGCTCTT ATGCATGCTA TACTGTTTTT GGCTTGGGGC 

CTATACACCC CCGCTTCCTT ATGCTATAGG 90 0 

TGATGGTATA GCTTAGCCTA TAGGTGTGGG TTATTGACCA TTATTGACCA CTCCAACGGT 

GGAGGGCAGT GTAGTCTGAG CAGTACTCGT 99 0 

TGCTGCCGCG CGCGCCACCA GACATAATAG CTGACAGACT AACAGACTGT TCCTTTCCAT 

GGGTCTTTTC TGCAGTCACC GTCGTCGACG 108 0 

GTATCGATAA GCTTGATCCA CCGCGGTGGC GGCCTGGCAC GACAGGTTTC CCGACTGGAA 

AGCGGGCAGT GAGCGCAACG CAATTAATGT 117 0 

GAGTTAGCTC ACTCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG CTCGTATGTT 

GTGTGGAATT GTGAGCGGAT AACAATTTCA 12 6 0 
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CACAGGAAAC AGCTATGACC 
AAGAGAAGGA ATTAATACGA 
GGAGAGAGAG AGAATTACCC 
TCTAGAGGAT CCCCCGGGCT 
CGCNNNNCCT CTCCCTCCCC 
CGGTGTGCGT TTGTCTATAT 
ACCATATTGC CGTCTTTTGG 
AGCATTCCTA GGGGTCTTTC 
AAAGGAATGC AAGGTCTGTT 
AGACAAACAA CGTCTGTAGC 
AGGCAGCGGA ACCCCCCACC 
GATACACCTG CAAAGGCGGC 
TGCCACGTTG TGAGTTGGAT 
AACAAGGGGC TGAAGGATGC 
CCCCATTGTA TGGGATCTGA 
AGGTTAAAAA AACGTCTAGG 
CACGGGGACG TGGTTTTCCT 
GATTAACATC GCTAAGAACG 
CATCGAACTG GCTGCTATCC 
TCGCGAACAG TTGGCCCTTG 
TTACGAGATG GGTGAAGCAC 
GGTTGCGGAT AACGCTGCCG 
CATCACTACC CTACTCCCTA 
AGCTAAGCGC GGCAAGCGCC 
CCAGTTCCTG CAAGAAATCA 
GGCTTGCCTA ACCAGTGCTG 
CGTTCAGGCT GTAGCAAGCG 
TATCCGTGAC CTTGAAGCTA 
GAAAAACGTT GAGGAACAAC 
GCAAGTTGTC GAGGCTGACA 
GGGTCTACTC GGTGGCGAGG 
AGTACGCTGC ATCGAGATGC 
AACCGGAATG GTTAGCTTAC 
TATCGAACTC GCACCTGAAT 
TATCGCAACC CGTGCAGGTG 
TCCTCCTAAG CCGTGGACTG 
TGGTGGCTAT TGGGCTAACG 
AGCACTGATG CGCTACGAAG 
GCCTGAGGTG TACAAAGCGA 
AGTCCTAGCG GTCGCCAACG 



ATGATTACGC CAAGCTCCAA 
C T C AC TAT AG 13 50 

TCACTAAAGG GAGGAGAAGC 
GCAGGAATTC 144 0 

CCCCCCTAAC GTTACTGGCC 
GTTATTTTCC 153 0 

CAATGTGAGG GCCCGGAAAC 
CCCTCTCGCC 162 0 

GAATGTCGTG AAGGAAGCAG 
GACCCTTTGC 1710 
TGGCGACAGG TGCCTCTGCG 
ACAACCCCAG 1800 
AGTTGTGGAA AGAGTCAAAT 
CCAGAAGGTA 18 90 

TCTGGGGCCT CGGTGCACAT 
CCCCCCGAAC 198 0 

TTGAAAAACA CGATGATAAT 
ACTTCTCTGA 2 07 0 

CGTTCAACAC TCTGGCTGAC 
AGCATGAGTC 216 0 

GCTTCCGCAA GATGTTTGAG 
CCAAGCCTCT 2 2 50 

AGATGATTGC ACGCATCAAC 
CGACAGCCTT 2 340 

AGCCGGAAGC CGTAGCGTAC 
ACAATACAAC 24 3 0 

CAATCGGTCG GGCCATTGAG 
AGCACTTCAA 2 52 0 

TCAACAAGCG CGTAGGGCAC 
TGCTCTCTAA 2 610 

CGTGGTCTTC GTGGCATAAG 
TCATTGAGTC 27 00 

ACCGCCAAAA TGCTGGCGTA 
ACGCTGAGGC 2 7 90 

CGCTGGCTGG CATCTCTCCG 
GCATTACTGG 2 8 80 

GTCGTCGTCC TCTGGCGCTG 
ACGTTTACAT 2 97 0 

TTAACATTGC GCAAAACACC 
TAATCACCAA 3 060 



CGATTTAGGT G AC AC TAT AG 

TTGCATGCCT GCAGGTCGAC 

GAAGCCGCTT GGAATAAGGC 

CTGGCCCTGT CTTCTTGACG 

TTCCTCTGGA AGCTTCTTGA 

GCCAAAAGCC ACGTGTATAA 

GGCTCTCCTC AAGCGTATTC 

GCTTTACATG TGTTTAGTCG 

ATGGCCACAA CCATGGACAC 

CATTACGGTG AGCGTTTAGC 

CGTCAACTTA AAGCTGGTGA 

GACTGGTTTG AGGAAGTGAA 

ATCACCATTA AGACCACTCT 

GACGAGGCTC GCTTCGGTCG 

GTCTACAAGA AAGCATTTAT 

GAAGACTCTA TTCATGTAGG 

GTAGGTCAAG ACTCTGAGAC 

ATGTTCCAAC CTTGCGTAGT 

GTGCGTACTC ACAGTAAGAA 

GCATGGAAAA TCAACAAGAA 
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GTGGAAGCAT TGTCCGGTCG 
ACCGGAAGAC ATCGACATGA 
TCTCACCGCG TGGAAACGTG 
TCGCCGTATC AGCCTTGAGT 
GCAAGCCAAT AAGTTTGCTA 
CGGTCGTGTT TACGCTGTGT 
CCCGCAAGGT AACGATATGA 
TAAGGAAGGT TACTACTGGC 
CGGTGCAAAC TGTGCGGGTG 
GGAAAACCAC GAGAACATCA 
TAAGTCTCCA CTGGAGAACA 
GTTCTGCTTT GAGTACGCTG 
CCACGGCCTG AGCTATAACT 
CCAGCACTTC TCCGCGATGC 
GGTAGGTGGT CGCGCGGTTA 
TGTTGCTAAG AAAGTCAACG 
AGCAGACGCA ATCAATGGGA 
TGAAATCTCT GAGAAAGTCA 
TAAGGCACTG GCTGGTCAAT 
AGTCATGACG CTGGCTTACG 
GTTCGGCTTC CGTCAACAAG 
GGGTCTGATG TTCACTCAGC 
TGCTGGATAC ATGGCTAAGC 
TGAAGCAATG AACTGGCTTA 
TAAGCTGCTG GCTGCTGAGG 
CGCTGTGCAT TGGGTAACTC 
CCCTGTGTGG CAGGAATACA 
TCAGTTCCGC TTACAGCCTA 
CAACAAAGAT AG CGAGATTG 
ACACAGCCAA GACGGTAGCC 
GACTGTAGTG TGGGCACACG 
CTTCGGTACC ATTCCGGCTG 
CCTGTTCAAA GCAGTGCGCG 
TGATTTCTAC GACCAGTTCG 
GCACGAGTCT CAATTGGACA 
TGACATCTTA GAGTCGGACT 
GTAACGCCAA ATCAATACGA 
CAAATAAAGC AATAGCATCA 
AAATAAAGCA TTTTTTTCAC 
TTATCATGTC TGGATCTGGT 



AGG AC AT C C C TGCGATTGAG 

ATCCTGAGGC 3150 

CTGCCGCTGC TGTGTACCGC 

TCATGCTTGA 3 24 0 

ACCATAAGGC CATCTGGTTC 

CAATGTTCAA 3330 

CCAAAGGACT GCTTACGCTG 

TGAAAATCCA 342 0 

TCGATAAGGT TCCGTTCCCT 

TGGCTTGCGC 3 510 

CTTGGTGGGC TGAGCAAGAT 

GGGTACAGCA 3 6 00 

GCTCCCTTCC GCTGGCGTTT 

TCCGAGATGA 3 6 90 

ACTTGCTTCC TAGTGAAACC 

AGATTCTACA 3 7 80 

C CG AT AAC G A AGTAGTTACC 

AGCTGGGCAC 3 870 

GGCTGGCTTA CGGTGTTACT 

GGTCCAAAGA 3 96 0 

TGCTGGAAGA TACCATTCAG 

CGAATCAGGC 4 0 50 

TGATTTGGGA ATCTGTGAGC 

AGTCTGCTGC 414 0 

TCAAAGATAA GAAGACTGGA 

CTGATGGTTT 42 3 0 

AGAAGCCTAT TCAGACGCGC 

C C ATT AAC AC 4 320 

ATGCACACAA ACAGGAGTCT 

ACCTTCGTAA 4 410 

AGAAGTACGG AATCGAATCT 

ACGCTGCGAA 4 5 00 

AAACTATGGT TGACACATAT 

CTGACCAGTT 4 5 90 

AAATGCCAGC ACTTCCGGCT 

TCGCGTTCGC 4 6 80 

CTCCGGATCT GAACTTGTTT 

C AAATTTCAC 4 7 7 0 

TGCATTCTAG TTGTGGTTTG 

TACCACTAAA 4 8 60 



CGTGAAGAAC TCCCGATGAA 
AAGGACAAGG CTCGCAAGTC 
C C TT AC AAC A TGGACTGGCG 
GCGAAAGGTA AACCAATCGG 
GAGCGCATCA AGTTCATTGA 
TCTCCGTTCT GCTTCCTTGC 
GACGGGTCTT GCTCTGGCAT 
GTTCAGGACA TCTACGGGAT 
GTGACCGATG AGAACACTGG 
CGCAGTGTGA CTAAGCGTTC 
CCAGCTATTG ATTCCGGCAA 
GTGACGGTGG TAGCTGCGGT 
GAGATTCTTC GCAAGCGTTG 
TTGAACCTGA TGTTCCTCGG 
GGTATCGCTC CTAACTTTGT 
TTTGCACTGA TTCACGACTC 
GAGTCTTGTG ATGTACTGGC 
AAAGGTAACT TGAACCTCCG 
ATTGCAGCTT ATAATGGTTA 
TCCAAACTCA TCAATGTATC 
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CCAGCCTCAA GAACACCCGA ATGGAGTCTC 
AAAATGTTGT CCCCCAAAAT GTAGCCATTC 
GTATCTGCTC CTAATAAAAA GAAAGTTTCT 
AAAAAAAAAA AACCCCCCCC CCCCCCCCCT 
GCAGGAATTC GATCTGGCAC GACAGGTTTC 
CAATTAATGT GAGTTAGCTC ACTCATTAGG 
CACCCCAGGC TTTACACTTT ATGCTTCCGG 
AACAATTTCA CACAGGAAAC AGCTATGACC 
ATGATTACGC CAAGCTCCAA CGATTTAGGT 
CTCACTATAG GGAGAGAGAG AG AATT AC C C 
TCACTAAAGG GAGGAGAAGC TTGCATGCCT 
CCAATTCGCC CCTCTCCCTC CCCCCCCCCT 
AACGTTACTG GCCGAAGCCG CTTGGAATAA 
TCCACCATAT TGCCGTCTTT TGGCAATGTG 
AGGGCCCGGA AACCTGGCCC TGTCTTCTTG 
GCCAAAGGAA TGCAAGGTCT GTTGAATGTC 
GTGAAGGAAG CAGTTCCTCT GGAAGCTTCT 
TGCAGGCAGC GGAACCCCCC ACCTGGCGAC 
AGGTGCCTCT GCGGCCAAAA GCCACGTGTA 
CAGTGCCACG TTGTGAGTTG GATAGTTGTG 
GAAAGAGTCA AATGGCTCTC CTCAAGCGTA 
GTACCCCATT GTATGGGATC TGATCTGGGG 
CCTCGGTGCA CATGCTTTAC ATGTGTTTAG 
ACCACGGGGA CGTGGTTTTC CTTTGAAAAA 
CACGATGATA ATATGGCCAC AACCATGGAA 
CCATTCTATC CGCTGGAAGA TGGAACCGCT 
GG AG AG C AAC TGCATAAGGC TATGAAGAGA 
ACAGATGCAC ATATCGAGGT GGACATCACT 
TACGCTGAGT ACTTCGAAAT GTCCGTTCGG 
AATACAAATC ACAGAATCGT CGTATGCAGT 
GAAAACTCTC TTCAATTCTT TATGCCGGTG 
GCGCCCGCGA ACGACATTTA TAATGAACGT 
GAATTGCTCA ACAGTATGGG CATTTCGCAG 
TTGCAAAAAA TTTTGAACGT GCAAAAAAAG 
CTCCCAATCA TCCAAAAAAT TATTATCATG 
TCGATGTACA CGTTCGTCAC ATCTCATCTA 
CCTCCCGGTT TTAATGAATA CGATTTTGTG 
G C AC TG AT C A TGAACTCCTC TGGATCTACT 
GGTCTGCCTA AAGGTGTCGC TCTGCCTCAT 
AGAGATCCTA TTTTTGGCAA TCAAATCATT 



TAAGCTACAT AATACCAACT TACACTTTAC 
4950 

TCACATTCTA AAAAAAAAAA AAAAAAAAAA 
5040 

CCGACTGGAA AGCGGGCAGT GAGCGCAACG 
5130 

CTCGTATGTT GTGTGGAATT GTGAGCGGAT 
5220 

GACACTATAG AAGAGAAGGA ATTAATACGA 
5310 

GCAGGTCGAT CGAGCATGCA TCTAGGGCGG 
5400 

GGCCGGTGTG TGTTTGTCTA TATGTGATTT 
5490 

ACGAGCATTC CTAGGGGTCT TTCCCCTCTC 
5580 

TGAAGACAAA CAACGTCTGT AGCGACCCTT 
5670 

TAAGATACAC CTGCAAAGGC GGCACAACCC 
5760 

TTCAACAAGG GGCTGAAGGA TGCCCAGAAG 
5850 

TCGAGGTTAA AAAACGTCTA GGCCCCCCGA 
5940 

GACGCCAAAA ACATAAAGAA AGGCCCGGCG 
6030 

TACGCCCTGG TTCCTGGAAC AATTGCTTTT 
6120 

TTGGCAGAAG CTATGAAACG ATATGGGCTG 
6210 

TTGGGCGCGT TATTTATCGG AGTTGCAGTT 
6300 

CCTACCGTGG TGTTCGTTTC CAAAAAGGGG 
6390 

GATTCTAAAA CGGATTACCA GGGATTTCAG 
6480 

CCAGAGTCCT TCGATAGGGA CAAGACAATT 
6570 

AG AACTGC CT GCGTGAGATT CTCGCATGCC 
6660 
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CCGGATACTG 
CTCGGATATT 
GTCTTAATGT 
CAAAGTGCGT 
TCATTCTTCG 
GCTTCTGGGG 
GTCGGGGAAG 
ACTGAGACTA 
CCCGAGGGGG 
GTTGTGGATC 
GGCGTTAATC 
AACAATCCGG 
GACAAGGATG 
TTCATAGTTG 
AAATACAAAG 
AACATCTTCG 
CCCGACGATG 
ATGACGGAAA 
GCCAGTCAAG 
CCGAAAGGTC 
AGAAAAATCA 
GTAACTGTAT 
AGCTATTGTA 
TGGAGTCTCT 
ACACTTTACA 
AAAGTTTCTT 
AAAAAAAAAA 
ATCTTTTTCC 
CATCATGAAG 
ATAGTGTGTT 
CTCGGAAGGA 
AGTTTGGCAA 
CCTCGCTCAC 
CAAAGGCGGT 
CAGGGGATAA 
AAAAGGCCGC 
GGCTCCGCCC 
CGACAGGACT 
CCCCTGGAAG 
CCGCCTTTCT 



CGATTTTAAG 
TGATATGTGG 
ATAGATTTGA 
TGCTAGTACC 
CCAAAAGCAC 
GCGCACCTCT 
CGGTTGCAAA 
CATCAGCTAT 
ATGATAAACC 
TGGATACCGG 
AGAGAGGCGA 
AAGCGACCAA 
GATGGCTACA 
ACCGCTTGAA 
GATATCAGGT 
ACGCGGGCGT 
ACGCCGGTGA 
AAGAGATCGT 
TAACAACCGC 
TTACCGGAAA 
GAGAGATCCT 
TCAGCGATGA 
ATACTCTAGA 
AAGCTACATA 
AAATGTTGTC 
CACATTCTAA 
ACCCCCCCCC 
CTCGCCAAAA 
CCCCTTGAGC 
GGAATTTTTT 
CATATGGGAG 
CATATGCCAT 
TGACTCGCTG 
AATACGGTTA 
CGCAGGAAAG 
GTTGCTGGCG 
CCCTGACGAG 
ATAAAGATAC 
CTCCCTCGTG 
CCCTTCGGGA 



TGTTGTTCCA 
ATTTCGAGTC 
AGAAGAGCTG 
AACCCTATTT 
TCTGATTGAC 
TTCGAAAGAA 
ACGCTTCCAT 
TCTGATTACA 
GGGCGCGGTC 
GAAAACGCTG 
ATTATGTGTC 
CGCCTTGATT 
TTCTGGAGAC 
GTCTTTAATT 
GGCCCCCGCT 
GGCAGGTCTT 
ACTTCCCGCC 
GGATTACGTC 
GAAAAAGTTG 
ACTCGACGCA 
CATAAAGGCC 
CGAAATTCTT 
GGATCTGGTT 
AT AC C AACTT 
CCCCAAAATG 
AAAAAAAAAA 
CCCCCCCCCC 
ATTATGGGGA 
ATCTGACTTC 
GTGTCTCTCA 
GGCAAATCAT 
TCTTCCGCTT 
CGCTCGGTCG 
TCCACAGAAT 
AACATGTGAG 
TTTTTCCATA 
CATCACAAAA 
CAGGCGTTTC 
CGCTCTCCTG 
AGCGTGGCGC 



TTCCATCACG 

6750 
TTTTTACGAT 

6840 
AAATACGATT 

6930 
CTTCCAGGGA 

7020 
GGTAAAGTTG 

7110 
AGAGGACCTA 

7200 
ATAGCTTACT 

7290 
GAATTGGAAT 

7380 
GCCGTTGTTG 

7470 
CGCGGAGGAG 

7560 
AAGAAGGGCG 

7650 
ACCACTAAAC 

7740 
TAGCCATTCG 

7830 
CCCCCCCCCC 

7920 
TGGCTAATAA 

8010 
TTAAAACATC 

8100 
TTCGGCTGCG 

8190 
CAAAAGGCCA 

8280 
ATCGACGCTC 

8370 
TTCCGACCCT 

8460 



GTTTTGGAAT 
CCCTTCAGGA 
TATCTAATTT 
TACGACAAGG 
TTCCATTTTT 
TGATTATGTC 
GGGACGAAGA 
CGATATTGTT 
TTTTGGAGCA 
TTGTGTTTGT 
GAAAGTCCAA 
CAGCCTCAAG 
TATCTGCTCC 
CTGCAGGTCG 
AGGAAATTTA 
AGAATCAGTA 
GCGAGCGGTA 
GCAAAAGGCC 
AAGTCAGAGG 
GCCGCTTACC 



GTTTACTACA 
TTACAAAATT 
ACACGAAATT 
ATATGGGCTC 
TGAAGCGAAG 
CGGTTATGTA 
CGAACACTTC 
ACAACACCCC 
CGGAAAGACG 
GGACGAAGTA 
ATTGTAAAAT 
AACACCCGAA 
TAATAAAAAG 
ACTCTAGAGG 
TTTCATTGCA 
TTTGGTTTAG 
TCAGCTCACT 
AGGAACCGTA 
TGGCGAAACC 
GGATACCTGT 
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TTTCTCAATG CTCACGCTGT 
GCTGTGTGCA CGAACCCCCC 
ACCGCTGCGC CTTATCCGGT 
CGCCACTGGC AG C AG C C AC T 
TT AG C AG AG C GAGGTATGTA 
GCTACACTAG AAGGACAGTA 
GCGCTCTGCT GAAGCCAGTT 
AAACCACCGC TGGTAGCGGT 
TTTGCAAGCA GCAGATTACG 
CTACGGGGTC TGACGCTCAG 
ACTCACGTTA AGGGATTTTG 
TAAATTAAAA ATGAAGTTTT 
AAAGTATATA TGAGTAAACT 
TCTCAGCGAT CTGTCTATTT 
TAGTTGCCTG ACTCCGGGGG 
GACTCATACC AGGCCTGAAT 
TCCAGCCAGA AAGTGAGGGA 
GTGATTTTGA ACTTTTGCTT 
CGGTCTGCGT TGTCGGGAAG 
ATTCAACAAA GCCGCCGTCC 
GCGTAATGCT CTGCCAGTGT 
GCATCAAATG AAACTGCAAT 
CAGGATTATC AAT AC CAT AT 
CGAGGCAGTT CCATAGGATG 
GGTATCGGTC TGCGATTCCG 
CAAAAATAAG GTTATCAAGT 
CATGAGTGAC G AC TG AAT C C 
GTTCAACAGG CCAGCCATTA 
CAAAATCACT CGCATCAACC 
ATACGCGATC GCTGTTAAAA 
AAACAGGAAT CGAATGCAAC 
CTGAATCAGG ATATTCTTCT 
ATGCTGTTTT CCCGGGGATC 
AATGCTTGAT GGTCGGAAGA 
CCGTCAGCCA GTTTAGTCTG 
CATGTTTCAG AAACAACTCT 
GCTTCCCATA CAATCGATAG 
TATACCCATA TAAATCAGCA 
AATTTAATCG CGGCCTCGAG 
TATTACTGTT TATGTAAGCA 



AGGTATCTCA GTTCGGTGTA 
GTTCAGCCCG 855 0 

AACTATCGTC TTGAGTCCAA 
GGTAACAGGA 8 64 0 

GGCGGTGCTA CAGAGTTCTT 
TTTGGTATCT 873 0 

ACCTTCGGAA AAAGAGTTGG 
GGTTTTTTTG 8 8 2 0 

CGCAGAAAAA AAGGATCTCA 
TGGAACGAAA 8 910 

GTCATGAGAT TATCAAAAAG 
AAATCAATCT 900 0 

TGGTCTGACA GTTACCAATG 
CGTTCATCCA 9090 

GGGGGGGCGC TGAGGTCTGC 
CGCCCCATCA 918 0 

GCCACGGTTG ATGAGAGCTT 
TGCCACGGAA 92 7 0 

ATGCGTGATC TGATCCTTCA 
CGTCAAGTCA 93 6 0 

TACAACCAAT TAACCAATTC 
TTATTCATAT 94 5 0 

TTTTGAAAAA GCCGTTTCTG 
GCAAGATCCT 954 0 

ACTCGTCCAA CATCAATACA 
GAGAAATCAC 963 0 

GGTGAGAATG GCAAAAGCTT 
CGCTCGTCAT 972 0 

AAACCGTTAT TCATTCGTGA 
GGACAATTAC 9810 

CGGCGCAGGA ACACTGCCAG 
AATACCTGGA 9 90 0 

GCAGTGGTGA GTAACCATGC 
GGCATAAATT 9990 

ACCATCTCAT CTGTAACATC 
GGCGCATCGG 10 080 

ATTGTCGCAC CTGATTGCCC 
TCCATGTTGG 1017 0 

CAAGACGTTT CCCGTTGAAT 
GACAGTTTTA 10260 



GGTCGTTCGC TCCAAGCTGG 

CCCGGTAAGA CACGACTTAT 

GAAGTGGTGG CCTAACTACG 

TAGCTCTTGA TCCGGCAAAC 

AGAAGATCCT TTGATCTTTT 

GATCTTCACC TAGATCCTTT 

CTTAATCAGT GAGGCACCTA 

CTCGTGAAGA AGGTGTTGCT 

TGTTGTAGGT GGACCAGTTG 

ACTCAGCAAA AGTTCGATTT 

TGATTAGAAA AACTCATCGA 

TAATGAAGGA GAAAACTCAC 

ACCTATTAAT TTCCCCTCGT 

ATGCATTTCT TTCCAGACTT 

TTGCGCCTGA GCGAGACGAA 

CGCATCAACA ATATTTTCAC 

ATCATCAGGA GTACGGATAA 

ATTGGCAACG CTACCTTTGC 

GACATTATCG CGAGCCCATT 

ATGG C T CAT A ACACCCCTTG 
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TTGTTCATGA TGATATATTT TTATCTTGTG 
CGTGGCTTTC CCCCCCCCCC CATTATTGAA 
GCATTTATCA GGGTTATTGT CTCATGAGCG 
AACAAATAGG GGTTCCGCGC ACATTTCCCC 
GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
GGCGTATCAC GAGGCCCTTT CGTCCTCGCG 
CGTTTCGGTG ATGACGGTGA AAACCTCTGA 
TGTCTGTAAG CGGATGCCGG GAG C AG AC AA 
GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC 
T C AG AG C AG A TTGTACTGAG AGTGCACCAT 
ATGCGGTGTG AAATACCGCA CAGATGCGTA 
10769 

SEQ ID NO: 47 
SIZE 5742 
DNA: LOS 3 

TGGCCATTGC ATACGTTGTA TCCATATCAT 
ACATTACCGC CATGTTGACA TTGATTATTG 
ACTAGTTATT AATAGTAATC AATTACGGGG 
CGCGTTACAT AACTTACGGT AAATGGCCCG 
CCTGGCTGAC CGCCCAACGA CCCCCGCCCA 
GTAACGCCAA TAGGGACTTT CCATTGACGT 
CAATGGGTGG AGTATTTACG GTAAACTGCC 
CCAAGTACGC CCCCTATTGA CGTCAATGAC 
GGTAAATGGC CCGCCTGGCA TTATGCCCAG 
CAGTACATCT ACGTATTAGT CATCGCTATT 
ACCATGGTGA TGCGGTTTTG GCAGTACATC 
GGATTTCCAA GTCTCCACCC CATTGACGTC 
AATGGGAGTT TGTTTTGGCA CCAAAATCAA 
GCCCCATTGA CGCAAATGGG CGGTAGGCGT 
GTACGGTGGG AGGTCTATAT AAGCAGAGCT 
CGCCATCCAC GCTGTTTTGA CCTCCATAGA 
AGACACCGGG ACCGATCCAG CCTCCGCGGC 
CGTGCCAAGA GTGACGTAAG TACCGCCTAT 
AGACTCTATA GGCACACCCC TTTGGCTCTT 
CTATACACCC CCGCTTCCTT ATGCTATAGG 
TGATGGTATA GCTTAGCCTA TAGGTGTGGG 
GGAGGGCAGT GTAGTCTGAG CAGTACTCGT 
TGCTGCCGCG CGCGCCACCA GACATAATAG 
GGGTCTTTTC TGCAGTCACC GTCGTCGACA 
CGTGTGATCA GATGATCCTC TAGACCAGGC 
TAGCTTGGCA TTCCGGTACT GTTGGTAAAG 



CAATGTAACA TCAGAGATTT TGAGACACAA 
10350 

GATACATATT TGAATGTATT TAGAAAAATA 
10440 

TTATTATCAT G AC ATT AAC C TATAAAAATA 
10530 

CACATGCAGC TCCCGGAGAC GGTCACAGCT 
10620 

GGGTGTCGGG GCTGGCTTAA CTATGCGGCA 
10710 

AGGAGAAAAT ACCGCATCAG ATTGGCTAT 



AATATGTACA TTTATATTGG CTCATGTCCA 
90 

TCATTAGTTC ATAGCCCATA TATGGAGTTC 
180 

TTGACGTCAA TAATGACGTA TGTTCCCATA 
270 

CACTTGGCAG TACATCAAGT GTATCATATG 
360 

TACATGACCT TATGGGACTT TCCTACTTGG 
450 

AATGGGCGTG GATAG CGGTT TGACTCACGG 
540 

CGGGACTTTC CAAAATGTCG TAACAACTCC 
630 

CGTTTAGTGA ACCGTCAGAT CGCCTGGAGA 
720 

CGGGAACGGT GCATTGGAAC GCGGATTCCC 
810 

ATGCATGCTA TACTGTTTTT GGCTTGGGGC 
900 

TTATTGACCA TTATTGACCA CTCCAACGGT 
990 

CTGACAGACT AACAGACTGT TCCTTTCCAT 
1080 

GCCTGGATCC GCTAGCAGGC CTAAGCTTGA 
1170 

72 



CCACCATGGA 


AGACGCCAAA 


AACATAAAGA 


AAGGCCCGGC 


GCCATTCTAT 


CCGCTGGAAG 


ATGGAACCGC 


TGGAGAGCAA 


CTGCATAAGG 


1260 






CTATGAAGAG 


ATACGCCCTG 


GTTCCTGGAA 


CAATTGCTTT 


T AC AG ATG C A 


CATATCGAGG 


TGGACATCAC 


TTACGCTGAG 


TACTTCGAAA 


1350 






TGTCCGTTCG 


GTTGGCAGAA 


GCTATGAAAC 


GATATGGGCT 


GAATACAAAT 


CACAGAATCG 


TCGTATGCAG 


TGAAAAC TCT 


CTTCAATTCT 


1440 






TTATGCCGGT 


GTTGGGCGCG 


TTATTTATCG 


GAGTTGCAGT 


TGCGCCCGCG 


AACGACATTT 


ATAATGAACG 


TGAATTGCTC 


AACAGTATGG 


1530 






GCATTTCGCA 


GCCTACCGTG 


GTGTTCGTTT 


CCAAAAAGGG 


GTTGCAAAAA 


ATTTTGAACG 


TGCAAAAAAA 


GCTCCCAATC 


ATCCAAAAAA 


1620 






TTATTATCAT 


GGATTCTAAA 


ACGGATTACC 


AGGGATTTCA 


GTCGATGTAC 


ACGTTCGTCA 


CATCTCATCT 


ACCTCCCGGT 


TTTAATGAAT 


1710 






ACGATTTTGT 


GCCAGAGTCC 


TTCGATAGGG 


ACAAGACAAT 


TGCACTGATC 


ATGAACTCCT 


CTGGATCTAC 


TGGTCTGCCT 


AAAGGTGTCG 


1800 






CTCTGCCTCA 


TAGAACTGCC 


TGCGTGAGAT 


TCTCGCATGC 


CAGAGATCCT 


ATTTTTGGCA 


ATCAAATCAT 


TCCGGATACT 


GCGATTTTAA 


1890 






GTGTTGTTCC 


ATTCCATCAC 


GGTTTTGGAA 


TGTTTACTAC 


ACTCGGATAT 


TTGATATGTG 


GATTTCGAGT 


CGTCTTAATG 


TATAGATTTG 


1980 






AAGAAGAGCT 


GTTTCTGAGG 


AGCCTTCAGG 


ATTACAAGAT 


TCAAAGTGCG 


CTGCTGGTGC 


CAACCCTATT 


CTCCTTCTTC 


GCCAAAAGCA 


2070 






CTCTGATTGA 


CAAATACGAT 


TTATCTAATT 


TACACGAAAT 


TGCTTCTGGT 


GGCGCTCCCC 


TCTCTAAGGA 


AGTCGGGGAA 


GCGGTTGCCA 


2160 






AGAGGTTCCA 


TCTGCCAGGT 


ATCAGGCAAG 


GATATGGGCT 


CACTGAGACT 


ACATCAGCTA 


TTCTGATTAC 


ACCCGAGGGG 


GATGATAAAC 


2250 






CGGGCGCGGT 


CGGTAAAGTT 


GTTCCATTTT 


TTGAAGCGAA 


GGTTGTGGAT 


CTGGATACCG 


GGAAAACGCT 


GGGCGTTAAT 


CAAAGAGGCG 


2340 






AACTGTGTGT 


GAGAGGTCCT 


ATGATTATGT 


CCGGTTATGT 


AAACAATCCG 


GAAGCGACCA 


ACGCCTTGAT 


TGACAAGGAT 


GGATGGCTAC 


2430 






ATTCTGGAGA 


CAT AG C TT AC 


TGGGACGAAG 


ACGAACACTT 


CTTCATCGTT 


GACCGCCTGA 


AGTCTCTGAT 


TAAGTACAAA 


GGCTATCAGG 


2520 






TGGCTCCCGC 


TGAATTGGAA 


TCCATCTTGC 


TCCAACACCC 


CAACATCTTC 


GACGCAGGTG 


TCGCAGGTCT 


TCCCGACGAT 


GACGCCGGTG 


2610 






AACTTCCCGC 


CGCCGTTGTT 


GTTTTGGAGC 


ACGGAAAGAC 


GATGACGGAA 


AAAGAGATCG 


TGGATTACGT 


CGCCAGTCAA 


GTAACAACCG 


2700 






CGAAAAAGTT 


GCGCGGAGGA 


GTTGTGTTTG 


TGGACGAAGT 


ACCGAAAGGT 


CTTACCGGAA 




AAf'AAAAA TP 


Hortorturt ILL 


£t / 27 \J 






TCATAAAGGC 


CAAGAAGGGC 


GGAAAGATCG 


CCGTGTAATT 


CTAGATCGAA 


TTCCTGCAGC 


CCGGGGGATC 


CAGATCTTTT 


TCCCTCGCCA 


2880 






AAAATTATGG 


GGACATCATG 


AAGCCCCTTG 


AGCATCTGAC 


TTCTGGCTAA 


TAAAGGAAAT 


TTATTTCATT 


GCAATAGTGT 


GTTGGAATTT 


2970 
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TTTGTGTCTC TCACTCGGAA 
GTATTTGGTT TAGAGTTTGG 
CATTCTTCCG CTTCCTCGCT 
GTATCAGCTC ACTCAAAGGC 
TTATCCACAG AATCAGGGGA 
GCCAGGAACC GTAAAAAGGC 
GCGTTTTTCC ATAGGCTCCG 
AGGTGGCGAA ACCCGACAGG 
TACCAGGCGT TTCCCCCTGG 
ACCGGATACC TGTCCGCCTT 
GGAAGCGTGG CGCTTTCTCA 
CGCTCCAAGC TGGGCTGTGT 
CCCGTTCAGC CCGACCGCTG 
AGACACGACT TATCGCCACT 
ACTGGTAACA GGATTAGCAG 
TGGCCTAACT ACGGCTACAC 
GTATTTGGTA TCTGCGCTCT 
TGATCCGGCA AACAAACCAC 
GGTGGTTTTT TTGTTTGCAA 
CCTTTGATCT TTTCTACGGG 
CAGTGGAACG AAAACTCACG 
ACCTAGATCC TTTTAAATTA 
TTTAAATCAA TCTAAAGTAT 
AGTGAGGCAC CTATCTCAGC 
TTTCGTTCAT CCATAGTTGC 
AGAAGGTGTT GCTGACTCAT 
AATCGCCCCA TCATCCAGCC 
GGTGGACCAG TTGGTGATTT 
CTTTGCCACG GAACGGTCTG 
AAAAGTTCGA TTTATTCAAC 
TCCCGTCAAG TCAGCGTAAT 
AAAAACTCAT CGAGCATCAA 
AATTTATTCA TATCAGGATT 
GGAGAAAACT CACCGAGGCA 
ATGGCAAGAT CCTGGTATCG 
AATTTCCCCT CGTCAAAAAT 
AGTGAGAAAT CACCATGAGT 
TCTTTCCAGA CTTGTTCAAC 
TTACGCTCGT CATCAAAATC 
TGAGCGAGAC GAAATACGCG 



GGACATATGG GAGGGCAAAT 
CAACATATGC 3 06 0 

CACTGACTCG CTGCGCTCGG 
GGTAATACGG 315 0 

TAACGCAGGA AAGAACATGT 
CGCGTTGCTG 324 0 

CCCCCCTGAC GAGCATCACA 
ACTATAAAGA 3 330 

AAGCTCCCTC GTGCGCTCTC 
TCTCCCTTCG 34 2 0 

ATGCTCACGC TGTAGGTATC 
GCACGAACCC 3 510 

CGCCTTATCC GGTAACTATC 
GGCAGCAGCC 3 600 

AGCGAGGTAT GTAGGCGGTG 
TAG AAGG AC A 3 6 90 

GCTGAAGCCA GTTACCTTCG 
CGCTGGTAGC 378 0 

GCAGCAGATT ACGCGCAGAA 
GTCTGACGCT 3 870 

TTAAGGGATT TTGGTCATGA 
AAAATGAAGT 3 960 

ATATGAGTAA ACTTGGTCTG 
GATCTGTCTA 4 05 0 

CTGACTCCGG GGGGGGGGGG 
ACCAGGCCTG 414 0 

AGAAAGTGAG GGAGCCACGG 
TGAACTTTTG 42 3 0 

CGTTGTCGGG AAGATGCGTG 
AAAGCCGCCG 4320 
GCTCTGCCAG TGTTACAACC 
ATGAAACTGC 4 410 

ATCAATACCA TATTTTTGAA 
GTTCCATAGG 4 50 0 

GTCTGCGATT CCGACTCGTC 
AAGGTTATCA 4 5 90 

GACGACTGAA TCCGGTGAGA 
AGGCCAGCCA 4 68 0 

ACTCGCATCA ACCAAACCGT 
ATCGCTGTTA 4 770 



CATTTAAAAC ATCAGAATCA 

TCGTTCGGCT GCGGCGAGCG 

GAGCAAAAGG CCAGCAAAAG 

AAAATCGACG CTCAAGTCAG 

CTGTTCCGAC CCTGCCGCTT 

TCAGTTCGGT GTAGGTCGTT 

GTCTTGAGTC CAACCCGGTA 

CTACAGAGTT CTTGAAGTGG 

GAAAAAGAGT TGGTAGCTCT 

AAAAAGGATC TCAAGAAGAT 

GATTATCAAA AAGGATCTTC 

ACAGTTACCA ATGCTTAATC 

CGCTGAGGTC TGCCTCGTGA 

TTGATGAGAG CTTTGTTGTA 

ATCTGATCCT TCAACTCAGC 

AATTAACCAA TTCTGATTAG 

AAAGCCGTTT CTGTAATGAA 

CAACATCAAT ACAACCTATT 

ATGGCAAAAG CTTATGCATT 

TATTCATTCG TGATTGCGCC 
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AAAGGACAAT TACAAACAGG AATCGAATGC 
ACAATATTTT CACCTGAATC AGGATATTCT 
TCTAATACCT GGAATGCTGT TTTCCCGGGG 
GGAGTACGGA TAAAATGCTT GATGGTCGGA 
AGAGGCATAA ATTCCGTCAG CCAGTTTAGT 
ACGCTACCTT TGCCATGTTT CAGAAACAAC 
TCTGGCGCAT CGGGCTTCCC ATACAATCGA 
TCGCGAGCCC ATTTATACCC ATATAAATCA 
GCATCCATGT TGGAATTTAA TCGCGGCCTC 
ATAACACCCC TTGTATTACT GTTTATGTAA 
GCAGACAGTT TTATTGTTCA TGATGATATA 
TTTTGAGACA CAACGTGGCT TTCCCCCCCC 
CCCCATTATT GAAGCATTTA TCAGGGTTAT 
ATTTAGAAAA ATAAACAAAT AGGGGTTCCG 
CGCACATTTC CCCGAAAAGT GCCACCTGAC 
ACCTATAAAA ATAGGCGTAT CACGAGGCCC 
TTTCGTCCTC GCGCGTTTCG GTGATGACGG 
GACGGTCACA GCTTGTCTGT AAGCGGATGC 
CGGGAGCAGA CAAGCCCGTC AGGGCGCGTC 
TAACTATGCG GCATCAGAGC AGATTGTACT 
GAGAGTGCAC CATATGCGGT GTGAAATACC 
CAGATTGGCT AT 

SEQ ID NO: 48 
SIZE 5313 
DNA: L05 9 

TCGCGCGTTT CGGTGATGAC GGTGAAAACC 

CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA 

GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 

CGGCATCAGA GCAGATTGTA CTGAGAGTGC 

ACCATATGCG GTGTGAAATA CCGCACAGAT 

ATTCGCCATT CAGGCTGCGC AACTGTTGGG 

AAGGGCGATC GGTGCGGGCC TCTTCGCTAT 

CAAGGCGATT AAGTTGGGTA ACGCCAGGGT 

TTTCCCAGTC ACGACGTTGT AAAACGACGG 

CCTCTAGAGT CGACCTGCAG GGGGGGGGGG 

GGGGGGGGGG GGGGGGGGGT TTTTTTTTTT 

CTTTTTATTA GG AG C AG AT A CGAATGGCTA 

CATTTTGGGG GACAACATTT TGTAAAGTGT 

TTCGGGTGTT CTTGAGGCTG GTTTAGTGGT 

AACCAGATCC TCTAGAGTAT TACAATAGCT 

ATTTTACAAT TTGGACTTTC CGCCCTTCTT 



AACCGGCGCA GGAACACTGC CAGCGCATCA 
4860 

ATCGCAGTGG TGAGTAACCA TGCATCATCA 
4950 

CTGACCATCT CATCTGTAAC ATCATTGGCA 
5040 

TAGATTGTCG CACCTGATTG CCCGACATTA 
5130 

GAGCAAGACG TTTCCCGTTG AATATGGCTC 
5220 

TTTTTATCTT GTGCAATGTA ACATCAGAGA 
5310 

TGTCTCATGA GCGGATACAT ATTTGAATGT 
5400 

GTCTAAGAAA CCATTATTAT CATGACATTA 
5490 

TGAAAACCTC TGACACATGC AGCTCCCGGA 
5580 

AGCGGGTGTT GGCGGGTGTC GGGGCTGGCT 
5670 

GCACAGATGC GTAAGGAGAA AATACCGCAT 
5742 



TCTGACACAT GCAGCTCCCG GAGACGGTCA 
90 

TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
180 

GCGTAAGGAG AAAATACCGC ATCAGGCGCC 
270 

TACGCCAGCT GGCGAAAGGG GGATGTGCTG 
360 

CCAGTGAATT CGAGCTCGGT ACCCGGGGAT 
450 

TTTTTTTTTT TTAGAATGTG AAGAAACTTT 
540 

AAGTTGGTAT TATGTAGCTT AGAGACTCCA 
630 

AAGAATTTCG TCATCGCTGA ATACAGTTAC 
720 
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GGCCTTTATG AGGATCTCTC TGATTTTTCT 
TACTTCGTCC ACAAACACAA CTCCTCCGCG 
CAACTTTTTC GCGGTTGTTA CTTGACTGGC 
CGTCTTTCCG TGCTCCAAAA CAACAACGGC 
GGCGGGAAGT TCACCGGCGT CATCGTCGGG 
GGGGTGTTGT AACAATATCG ATTCCAATTC 
AGCGGGGGCC ACCTGATATC CTTTGTATTT 
GAAGTGTTCG TCTTCGTCCC AGTAAGCTAT 
GTCTCCAGAA TGTAGCCATC CATCCTTGTC 
TACATAACCG GACATAATCA TAGGTCCTCT 
GACACATAAT TCGCCTCTCT GATTAACGCC 
CTTCGCTTCA AAAAATGGAA CAACTTTACC 
GACCGCGCCC GGTTTATCAT CCCCCTCGGG 
GAG C C CAT AT CCTTGTCGTA TCCCTGGAAG 
ATGGAAGCGT TTTGCAACCG CTTCCCCGAC 
AATTTCGTGT AAATTAGATA AATCGTATTT 
GTCAATCAGA GTGCTTTTGG CGAAGAATGA 
AATTTTGTAA TCCTGAAGGG ATCGTAAAAA 
CAGCTCTTCT TCAAATCTAT ACATTAAGAC 
TGTAGTAAAC ATTCCAAAAC CGTGATGGAA 
TGGAACAACA CTTAAAATCG CAGTATCCGG 
GGCATGCGAG AATCTCACGC AGGCAGTTCT 
ATGAGGCAGA GCGACACCTT TAGGCAGACC 
AATTGTCTTG TCCCTATCGA AGGACTCTGG 
CACAAAATCG TATTCATTAA AACCGGGAGG 
CTGAAATCCC TGGTAATCCG TTTTAGAATC 
CATGATAATA ATTTTTTGGA TGATTGGGAG 
CCCCTTTTTG GAAACGAACA CCACGGTAGG 
CTGCGAAATG CCCATACTGT TGAGCAATTC 
AACTGCAACT CCGATAAATA ACGCGCCCAA 
CACCGGCATA AAGAATTGAA GAGAGTTTTC 
C AG C C CAT AT CGTTTCATAG CTTCTGCCAA 
CCGAACGGAC ATTTCGAAGT ACTCAGCGTA 
AAAAGCAATT GTTCCAGGAA CCAGGGCGTA 
TCTCTTCATA GCCTTATGCA GTTGCTCTCC 
CGCCGGGCCT TTCTTTATGT TTTTGGCGTC 
TTCCATGGTT GTGGCCATAT TATCATCGTG 
TCGGGGGGCC TAGACGTTTT TTAACCTCGA 
CTAAACACAT GTAAAGCATG TGCACCGAGG 
CTTCTGGGCA TCCTTCAGCC CCTTGTTGAA 



TGCGTCGAGT TTTCCGGTAA GACCTTTCGG 
810 

GACGTAATCC ACGATCTCTT TTTCCGTCAT 
900 

AAGACCTGCC ACGCCCGCGT CGAAGATGTT 
990 

AATTAAAGAC TTCAAGCGGT CAACTATGAA 
1080 

AATCAAGGCG TTGGTCGCTT CCGGATTGTT 
1170 

CAGCGTTTTC CCGGTATCCA GATCCACAAC 
1260 

TGTAATCAGA ATAGCTGATG TAGTCTCAGT 
1350 

TTCTTTCGAA AGAGGTGCGC CCCCAGAAGC 
1440 

AAATAGGGTT GGTACTAGCA ACGCACTTTG 
1530 

GACTCGAAAT CCACATATCA AATATCCGAG 
1620 

AATGATTTGA TTGCCAAAAA TAGGATCTCT 
1710 

AGTAGATCCA GAGGAGTTCA TGATCAGTGC 
1800 

TAGATGAGAT GTGACGAACG TGTACATCGA 
1890 

CTTTTTTTGC ACGTTCAAAA TTTTTTGCAA 
1980 

ACGTTCATTA TAAATGTCGT TCGCGGGCGC 
2070 

ACTGCATACG ACGATTCTGT GATTTGTATT 
2160 

AGTGATGTCC ACCTCGATAT GTGCATCTGT 
2250 

AGCGGTTCCA TCTTCCAGCG GATAGAATGG 
2340 

TTTTTCAAAG GAAAACCACG TCCCCGTGGT 
2430 

CCCCAGATCA GATCCCATAC AATGGGGTAC 
2520 
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TACGCTTGAG GAG AG C C ATT 
GGGTTGTGCC GCCTTTGCAG 
TACACGTGGC TTTTGGCCGC 
AAGGGTCGCT ACAGACGTTG 
AGAAGCTTCC AGAGGAACTG 
GAGAGGGGAA AGACCCCTAG 
CAAGAAGACA GGGCCAGGTT 
AAATCACATA TAGACAAACA 
TTATTCCAAG CGGCTTCGGC 
CCGCCCTAGA TGCATGCTCG 
AGGCATGCAA GCTTCTCCTC 
TCGTATTAAT TCCTTCTCTT 
ACCTAAATCG TTGGAGCTTG 
ATCCGCTCAC AATTCCACAC 
C CGG AAG CAT AAAGTGTAAA 
CGTTGCGCTC ACTGCCCGCT 
GAAACCTGTC GTGCCAGCTG 
GTATTGGGCG CTCTTCCGCT 
CTGACTCGCT GCGCTCGGTC 
TAATACGGTT ATCCACAGAA 
ACGCAGGAAA GAACATGTGA 
CGTTGCTGGC GTTTTTCCAT 
CCCCTGACGA GCATCACAAA 
TATAAAGATA CCAGGCGTTT 
GCTCCCTCGT GCGCTCTCCT 
TCCCTTCGGG AAGCGTGGCG 
GCTCACGCTG TAGGTATCTC 
ACGAACCCCC CGTTCAGCCC 
CCTTATCCGG TAACTATCGT 
CAGCAGCCAC TGGTAACAGG 
CGAGGTATGT AGGCGGTGCT 
GAAGGACAGT ATTTGGTATC 
TGAAGCCAGT TACCTTCGGA 
CTGGTAGCGG TGGTTTTTTT 
AG C AG ATT AC GCGCAGAAAA 
CTGACGCTCA GTGGAACGAA 
AAGGGATTTT GGTCATGAGA 
AATGAAGTTT TAAATCAATC 
ATGAGTAAAC TTGGTCTGAC 
TCTGTCTATT TCGTTCATCC 



TGACTCTTTC CACAACTATC 
GTGTATCTTA 2 610 

AGAGGCACCT GTCGCCAGGT 
TTTGTCTTCA 2 7 0 0 

CTTCCTTCAC GACATTCAAC 
GAATGCTCGT 2 7 90 

TCCGGGCCCT CACATTGCCA 
CACACCGGCC 2880 
CAGTAACGTT AGGGGGGGGG 
ATCGACCTGC 2 97 0 

CCTTTAGTGA GGGTAATTCT 
CTATAGTGTC 3 060 

GCGTAATCAT GGTCATAGCT 
AACATACGAG 315 0 

GCCTGGGGTG CCTAATGAGT 
TTCCAGTCGG 3 24 0 

CATTAATGAA TCGGCCAACG 
TCCTCGCTCA 33 3 0 

GTTCGGCTGC GGCGAGCGGT 
TCAGGGGATA 342 0 

GCAAAAGGCC AGCAAAAGGC 
AGGCTCCGCC 3 510 

AATCGACGCT CAAGTCAGAG 
CCCCCTGGAA 3 60 0 

GTTCCGACCC TGCCGCTTAC 
CTTTCTCATA 3 6 90 

AGTTCGGTGT AGGTCGTTCG 
GACCGCTGCG 3780 
CTTGAGTCCA ACCCGGTAAG 
ATTAGCAGAG 3 87 0 

ACAGAGTTCT TGAAGTGGTG 
TGCGCTCTGC 3 960 

AAAAGAGTTG GTAGCTCTTG 
GTTTGCAAGC 4 05 0 

AAAGGATCTC AAGAAGATCC 
AACTCACGTT 414 0 

TTATCAAAAA GGATCTTCAC 
TAAAGTATAT 4 2 3 0 

AGTTACCAAT GCTTAATCAG 
ATAGTTGCCT 4 32 0 



CAACTCACAA CGTGGCACTG 

GGGGGGTTCC GCTGCCTGCA 

AGACCTTGCA TTCCTTTGGC 

AAAGACGGCA ATATGGTGGA 

GAGGGAGAGG GGCGAATTGG 

CTCTCTCTCC CTATAGTGAG 

GTTTCCTGTG TGAAATTGTT 

GAGCTAACTC ACATTAATTG 

CGCGGGGAGA GGCGGTTTGC 

ATCAGCTCAC TCAAAGGCGG 

CAGGAACCGT AAAAAGGCCG 

GTGGCGAAAC CCGACAGGAC 

CGGATACCTG TCCGCCTTTC 

CTCCAAGCTG GGCTGTGTGC 

ACACGACTTA TCGCCACTGG 

GCCTAACTAC GGCTACACTA 
ATCCGGCAAA CAAACCACCG 
TTTGATCTTT TCTACGGGGT 
CTAGATCCTT TTAAATTAAA 
TGAGGCACCT ATCTCAGCGA 
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GACTCCCCGT CGTGTAGATA ACTACGATAC 
C AATG AT AC C G CG AG AC C C A CGCTCACCGG 
CTCCAGATTT ATCAGCAATA AACCAGCCAG 
CAACTTTATC CGCCTCCATC CAGTCTATTA 
ATTGTTGCCG GG AAG C TAG A GTAAGTAGTT 
CCATTGCTAC AGGCATCGTG GTGTCACGCT 
CGTCGTTTGG TATGGCTTCA TTCAGCTCCG 
CCCCCATGTT GTGCAAAAAA GCGGTTAGCT 
CCTTCGGTCC TCCGATCGTT GTCAGAAGTA 
TGGCAGCACT GCATAATTCT CTTACTGTCA 
TGCCATCCGT AAGATGCTTT TCTGTGACTG 
AGTGTATGCG GCGACCGAGT TGCTCTTGCC 
CGGCGTCAAT ACGGGATAAT ACCGCGCCAC 
GAAAACGTTC TTCGGGGCGA AAACTCTCAA 
GGATCTTACC GCTGTTGAGA TCCAGTTCGA 
CAGCATCTTT TACTTTCACC AGCGTTTCTG 
GGTGAGCAAA AACAGGAAGG CAAAATGCCG 
GTTGAATACT CATACTCTTC CTTTTTCAAT 
ATTATTGAAG CATTTATCAG GGTTATTGTC 
AGAAAAATAA ACAAATAGGG GTTCCGCGCA 
CATTTCCCCG AAAAGTGCCA CCTGACGTCT 
ATAAAAATAG GCGTATCACG AGGCCCTTTC 
GTC 



GGGAGGGCTT ACCATCTGGC CCCAGTGCTG 
4410 

CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG 
4500 

CGCCAGTTAA TAGTTTGCGC AACGTTGTTG 
4590 

GTTCCCAACG ATCAAGGCGA GTTACATGAT 
4680 

AGTTGGCCGC AGTGTTATCA CTCATGGTTA 
4770 

GTGAGTACTC AACCAAGTCA TTCTGAGAAT 
4860 

ATAGCAGAAC TTTAAAAGTG CTCATCATTG 
4950 

TGTAACCCAC TCGTGCACCC AACTGATCTT 
5040 

CAAAAAAGGG AATAAGGGCG ACACGGAAAT 
5130 

TCATGAGCGG AT AC AT AT TT GAATGTATTT 
5220 

AAG AAAC CAT TATTATCATG ACATTAACCT 
5310 
5313 



SEQ ID NO: 49 
SIZE 7940 
DNA: R02 3 

TGGCCATTGC ATACGTTGTA TCCATATCAT 

ACATTACCGC CATGTTGACA TTGATTATTG 

ACTAGTTATT AATAGTAATC AATTACGGGG 

CGCGTTACAT AACTTACGGT AAATGGCCCG 

CCTGGCTGAC CGCCCAACGA CCCCCGCCCA 

GTAACGCCAA TAGGGACTTT CCATTGACGT 

CAATGGGTGG AGTATTTACG GTAAACTGCC 

CCAAGTACGC CCCCTATTGA CGTCAATGAC 

GGTAAATGGC CCGCCTGGCA TTATGCCCAG 

CAGTACATCT ACGTATTAGT CATCGCTATT 

ACCATGGTGA TGCGGTTTTG GCAGTACATC 

GGATTTCCAA GTCTCCACCC CATTGACGTC 

AATGGGAGTT TGTTTTGGCA CCAAAATCAA 

GCCCCATTGA CG C AAATGGG CGGTAGGCGT 



AATATGTACA TTTATATTGG CTCATGTCCA 
90 

TCATTAGTTC ATAGCCCATA TATGGAGTTC 
180 

TTGACGTCAA TAATGACGTA TGTTCCCATA 
270 

CACTTGGCAG TACATCAAGT GTATCATATG 
360 

TACATGACCT T ATGGG AC TT TCCTACTTGG 
450 

AATGGGCGTG GATAGCGGTT TGACTCACGG 
540 

CGGGACTTTC CAAAATGTCG TAACAACTCC 
630 
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GTACGGTGGG AGGTCTATAT 
CGCCATCCAC GCTGTTTTGA 
AGACACCGGG ACCGATCCAG 
CGTGCCAAGA GTGACGTAAG 
AGACTCTATA GGCACACCCC 
CTATACACCC CCGCTTCCTT 
TGATGGTATA GCTTAGCCTA 
GGAGGGCAGT GTAGTCTGAG 
TGCTGCCGCG CGCGCCACCA 
GGGTCTTTTC TGCAGTCACC 
GTATCGATAA GCTTGATCCA 
AGCGGGCAGT GAGCGCAACG 
GAGTTAGCTC ACTCATTAGG 
GTGTGGAATT GTGAGCGGAT 
CACAGGAAAC AGCTATGACC 
AAGAGAAGGA ATTAATACGA 
GGAGAGAGAG AGAATTACCC 
TCTAGAGGAT CCCCCGGGCT 
CGCNNNNCCT CTCCCTCCCC 
CGGTGTGCGT TTGTCTATAT 
AC C AT ATTG C CGTCTTTTGG 
AGCATTCCTA GGGGTCTTTC 
AAAGGAATGC AAGGTCTGTT 
AGACAAACAA CGTCTGTAGC 
AGGCAGCGGA ACCCCCCACC 
GATACACCTG CAAAGGCGGC 
TGCCACGTTG TGAGTTGGAT 
AACAAGGGGC TGAAGGATGC 
CCCCATTGTA TGGGATCTGA 
AGGTTAAAAA AACGTCTAGG 
CACGGGGACG TGGTTTTCCT 
GATTAACATC GCTAAGAACG 
CATCGAACTG GCTGCTATCC 
TCGCGAACAG TTGGCCCTTG 
TTACGAGATG GGTGAAGCAC 
GGTTGCGGAT AACGCTGCCG 
CATCACTACC CTACTCCCTA 
AGCTAAGCGC GGCAAGCGCC 
CCAGTTCCTG CAAGAAATCA 
GGCTTGCCTA ACCAGTGCTG 



AAGCAGAGCT CGTTTAGTGA 
CCTCCATAGA 72 0 

CCTCCGCGGC CGGGAACGGT 
TACCGCCTAT 810 
TTTGGCTCTT ATGCATGCTA 
ATGCTATAGG 90 0 

TAGGTGTGGG TT ATTG AC C A 
CAGTACTCGT 990 
GACATAATAG CTGACAGACT 
GTCGTCGACG 10 80 

CCGCGGTGGC GGCCTGGCAC 
CAATTAATGT 117 0 

CACCCCAGGC TTTACACTTT 
AACAATTTCA 12 6 0 

ATGATTACGC CAAGCTCCAA 
CTCACTATAG 13 5 0 

TCACTAAAGG GAGGAGAAGC 
GCAGGAATTC 144 0 

CCCCCCTAAC GTTACTGGCC 
GTTATTTTCC 153 0 

CAATGTGAGG GCCCGGAAAC 
CCCTCTCGCC 162 0 

GAATGTCGTG AAGGAAGCAG 
GACCCTTTGC 1710 
TGGCGACAGG TGCCTCTGCG 
ACAACCCCAG 1800 
AGTTGTGGAA AGAGT CAAAT 
CCAGAAGGTA 18 90 

TCTGGGGCCT CGGTGCACAT 
CCCCCCGAAC 1980 
TTGAAAAACA CGATGATAAT 
ACTTCTCTGA 2 070 

CGTTCAACAC TCTGGCTGAC 
AGCATGAGTC 2160 
GCTTCCGCAA GATGTTTGAG 
CCAAGCCTCT 2 250 

AGATGATTGC ACGCATCAAC 
CGACAGCCTT 2 34 0 

AGCCGGAAGC CGTAGCGTAC 
ACAATACAAC 24 3 0 



ACCGTCAGAT CGCCTGGAGA 

GCATTGGAAC GCGGATTCCC 

TACTGTTTTT GGCTTGGGGC 

TTATTGACCA CTCCAACGGT 

AACAGACTGT TCCTTTCCAT 

GACAGGTTTC CCGACTGGAA 

ATGCTTCCGG CTCGTATGTT 

CGATTTAGGT GACACTATAG 

TTGCATGCCT GCAGGTCGAC 

GAAGCCGCTT GGAATAAGGC 

CTGGCCCTGT CTTCTTGACG 

TTCCTCTGGA AGCTTCTTGA 

GCCAAAAGCC ACGTGTATAA 

GGCTCTCCTC AAGCGTATTC 

GCTTTACATG TGTTTAGTCG 

ATGGCCACAA CCATGGACAC 

CATTACGGTG AGCGTTTAGC 

CGTCAACTTA AAGCTGGTGA 

GACTGGTTTG AGGAAGTGAA 

ATCACCATTA AGACCACTCT 
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CGTTCAGGCT GTAGCAAGCG 
TATCCGTGAC CTTGAAGCTA 
GAAAAACGTT GAGGAACAAC 
GCAAGTTGTC GAGGCTGACA 
GGGTCTACTC GGTGGCGAGG 
AGTACGCTGC ATCGAGATGC 
AACCGGAATG GTTAGCTTAC 
TATCGAACTC GCACCTGAAT 
TATCGCAACC CGTGCAGGTG 
TCCTCCTAAG CCGTGGACTG 
TGGTGGCTAT TGGGCTAACG 
AGCACTGATG CGCTACGAAG 
GCCTGAGGTG TACAAAGCGA 
AGTCCTAGCG GTCGCCAACG 
GTGGAAGCAT TGTCCGGTCG 
AC CGG AAG AC ATCGACATGA 
TCTCACCGCG TGGAAACGTG 
TCGCCGTATC AGCCTTGAGT 
GCAAGCCAAT AAGTTTGCTA 
CGGTCGTGTT TACGCTGTGT 
CCCGCAAGGT AACGATATGA 
TAAGGAAGGT TACTACTGGC 
CGGTGCAAAC TGTGCGGGTG 
GGAAAACCAC G AG AAC AT C A 
TAAGTCTCCA CTGGAGAACA 
GTTCTGCTTT GAGTACGCTG 
CCACGGCCTG AGCTATAACT 
CCAGCACTTC TCCGCGATGC 
GGTAGGTGGT CGCGCGGTTA 
TGTTGCTAAG AAAGTCAACG 
AGCAGACGCA ATCAATGGGA 
TGAAATCTCT GAGAAAGTCA 
TAAGGCACTG GCTGGTCAAT 
AGTCATGACG CTGGCTTACG 
GTTCGGCTTC CGTCAACAAG 
GGGTCTGATG TTCACTCAGC 
TGCTGGATAC ATGGCTAAGC 
TGAAGCAATG AACTGGCTTA 
TAAGCTG CTG GCTGCTGAGG 
CGCTGTGCAT TGGGTAACTC 



CAATCGGTCG GGCCATTGAG 
AGCACTTCAA 2 52 0 

TCAACAAGCG CGTAGGGCAC 
TGCTCTCTAA 2 610 

CGTGGTCTTC GTGGCATAAG 
TCATTGAGTC 2 7 00 

ACCGCCAAAA TGCTGGCGTA 
ACGCTGAGGC 2 7 90 

CGCTGGCTGG CATCTCTCCG 
GCATTACTGG 2 8 80 

GTCGTCGTCC TCTGGCGCTG 
ACGTTTACAT 2 97 0 

TTAACATTGC GCAAAACACC 
TAATCACCAA 3 060 

AGG AC AT C C C TGCGATTGAG 
ATCCTGAGGC 315 0 

CTGCCGCTGC TGTGTACCGC 
TCATGCTTGA 3 24 0 

ACCATAAGGC CATCTGGTTC 
CAATGTTCAA 3 3 3 0 

CCAAAGGACT GCTTACGCTG 
TGAAAATCCA 342 0 

TCGATAAGGT TCCGTTCCCT 
TGGCTTGCGC 3 510 

CTTGGTGGGC TG AG C AAG AT 
GGGTACAGCA 3 600 

GCTCCCTTCC GCTGGCGTTT 
TCCGAGATGA 3 6 90 

ACTTGCTTCC TAGTGAAACC 
AGATTCTACA 3 78 0 

CCGATAACGA AGTAGTTACC 
AGCTGGGCAC 3 87 0 

GGCTGGCTTA CGGTGTTACT 
GGTCCAAAGA 3 96 0 

TGCTGGAAGA TACCATTCAG 
CGAATCAGGC 4 05 0 

TGATTTGGGA ATCTGTGAGC 
AGTCTGCTGC 414 0 

TCAAAGATAA GAAGACTGGA 
CTGATGGTTT 42 3 0 



GACGAGGCTC GCTTCGGTCG 

GTCTACAAGA AAGCATTTAT 

GAAGACTCTA TTCATGTAGG 

GTAGGTCAAG ACTCTGAGAC 

ATGTTCCAAC CTTGCGTAGT 

GTGCGTACTC ACAGTAAGAA 

GCATGGAAAA T C AAC AAG AA 

CGTGAAGAAC TCCCGATGAA 

AAGGACAAGG CTCGCAAGTC 

CCTTACAACA TGGACTGGCG 

GCGAAAGGTA AACCAATCGG 

GAGCGCATCA AGTTCATTGA 

TCTCCGTTCT GCTTCCTTGC 

GACGGGTCTT GCTCTGGCAT 
GTTCAGGACA TCTACGGGAT 

GTGACCGATG AGAACACTGG 
CGCAGTGTGA CTAAGCGTTC 
CCAGCTATTG ATTCCGGCAA 
GTGACGGTGG TAGCTGCGGT 
GAGATTCTTC GCAAGCGTTG 
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CCCTGTGTGG CAGGAATACA 
TCAGTTCCGC TTACAGCCTA 
CAACAAAGAT AGCGAGATTG 
ACACAGCCAA GACGGTAGCC 
GACTGTAGTG TGGGCACACG 
CTTCGGTACC ATTCCGGCTG 
CCTGTTCAAA GCAGTGCGCG 
TGATTTCTAC GACCAGTTCG 
GCACGAGTCT CAATTGGACA 
TGACATCTTA GAGTCGGACT 
GTAACGCCAA ATCAATACGA 
CAAATAAAGC AAT AG CAT C A 
AAATAAAGCA TTTTTTTCAC 
TTATCATGTC TGGATCTGGT 
CCAGCCTCAA GAACACCCGA 
AAAATGTTGT CCCCCAAAAT 
GTATCTGCTC CTAATAAAAA 
AACCCCCCCC CCCCCCCCCC 
CCTGCAGGTC GACTCTAGAG 
GCCCCTTGAG CATCTGACTT 
AAGGAAATTT ATTTCATTGC 
ACATATGGGA GGGCAAATCA 
CAGAATCAGT ATTTGGTTTA 
CTGACTCGCT GCGCTCGGTC 
GGCGAGCGGT ATCAGCTCAC 
ACGCAGGAAA GAACATGTGA 
AG C AAAAGG C CAGGAACCGT 
CCCCTGACGA GCATCACAAA 
CAAGTCAGAG GTGGCGAAAC 
GCTCCCTCGT GCGCTCTCCT 
TGCCGCTTAC CGGATACCTG 
GCTCACGCTG TAGGTATCTC 
AGGTCGTTCG CTCCAAGCTG 
CCTTATCCGG TAACTATCGT 
ACCCGGTAAG ACACGACTTA 
CGAGGTATGT AGGCGGTGCT 
TGAAGTGGTG GCCTAACTAC 
TGAAGCCAGT TACCTTCGGA 
GTAGCTCTTG ATCCGGCAAA 
AGCAGATTAC GCGCAGAAAA 



AGAAGCCTAT TCAGACGCGC 
C C ATT AAC AC 4 32 0 

ATGCACACAA ACAGGAGTCT 
ACCTTCGTAA 4410 
AGAAGTACGG AATCGAATCT 
ACGCTGCGAA 4 500 

AAACTATGGT TGACACATAT 
CTGACCAGTT 4 5 90 

AAATGCCAGC ACTTCCGGCT 
TCGCGTTCGC 4 68 0 

CTCCGGATCT GAACTTGTTT 
CAAATTTCAC 4 770 

TGCATTCTAG TTGTGGTTTG 
T AC CACT AAA 4 86 0 

ATGGAGTCTC TAAGCTACAT 
GTAGCCATTC 4 950 

GAAAGTTTCT TCACATTCTA 
CCCCCCCCCC 504 0 

GATCTTTTTC CCTCGCCAAA 
CTGGCTAATA 513 0 

AATAGTGTGT TGGAATTTTT 
TTTAAAACAT 52 2 0 

GAGTTTGGCA ACATATGCCA 
GTTCGGCTGC 5 310 

TCAAAGGCGG TAATACGGTT 
GCAAAAGGCC 54 00 

AAAAAGGCCG CGTTGCTGGC 
AATCGACGCT 54 9 0 

CCGACAGGAC TATAAAGATA 
GTTCCGACCC 5580 
TCCGCCTTTC TCCCTTCGGG 
AGTTCGGTGT 5670 
GGCTGTGTGC ACGAACCCCC 
CTTGAGTCCA 57 60 

TCGCCACTGG CAGCAGCCAC 
ACAGAGTTCT 58 5 0 

GGCTACACTA GAAGGACAGT 
AAAAGAGTTG 5 94 0 

CAAACCACCG CTGGTAGCGG 
AAAGGATCTC 603 0 



TTGAACCTGA TGTTCCTCGG 

GGTATCGCTC CTAACTTTGT 

TTTGCACTGA TTCACGACTC 

GAGTCTTGTG ATGTACTGGC 

AAAGGTAACT TGAACCTCCG 

ATTGC AG CTT ATAATGGTTA 

TCCAAACTCA TCAATGTATC 

AATACCAACT TACACTTTAC 

AAAAAAAAAA AAAAAAAAAA 

AATTATGGGG ACATCATGAA 

TGTGTCTCTC ACTCGGAAGG 

TTCTTCCGCT TCCTCGCTCA 

ATCCACAGAA TCAGGGGATA 

GTTTTTCCAT AGGCTCCGCC 

CCAGGCGTTT CCCCCTGGAA 

AAGCGTGGCG CTTTCTCAAT 

CGTTCAGCCC GACCGCTGCG 

TGGTAACAGG ATTAGCAGAG 

ATTTGGTATC TGCGCTCTGC 

TGGTTTTTTT GTTTGCAAGC 
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AAGAAGATCC TTTGATCTTT 
AAGGGATTTT GGTCATGAGA 
GGATCTTCAC CTAGATCCTT 
ATGAGTAAAC TTGGTCTGAC 
GCTTAATCAG TGAGGCACCT 
GACTCCGGGG GGGGGGGGCG 
CCTCGTGAAG AAGGTGTTGC 
AAAGTGAGGG AGCCACGGTT 
TTGTTGTAGG TGGACCAGTT 
TTGTCGGGAA GATGCGTGAT 
AACTCAGCAA AAGTTCGATT 
TCTGCCAGTG TTACAACCAA 
CTGATTAGAA AAACTCATCG 
CAATACCATA TTTTTGAAAA 
GTAATGAAGG AGAAAACTCA 
CTGCGATTCC GACTCGTCCA 
AACCTATTAA TTTCCCCTCG 
CGACTGAATC CGGTGAGAAT 
TATGCATTTC TTTCCAGACT 
TCGCATCAAC CAAACCGTTA 
ATTGCGCCTG AGCGAGACGA 
TCGAATGCAA CCGGCGCAGG 
GCGCATCAAC AATATTTTCA 
TCCCGGGGAT CGCAGTGGTG 
CATCATCAGG AGTACGGATA 
AGTTTAGTCT GACCATCTCA 
CATTGGCAAC GCTACCTTTG 
ACAATCGATA GATTGTCGCA 
CGACATTATC GCGAGCCCAT 
GCGGCCTCGA GCAAGACGTT 
TATGGCTCAT AACACCCCTT 
ATGATATATT TTTATCTTGT 
ATCAGAGATT TTGAGACACA 
AGGGTTATTG TCTCATGAGC 
TTGAATGTAT TTAGAAAAAT 
CACCTGACGT CTAAGAAACC 
TGACATTAAC CTATAAAAAT 
GATGACGGTG AAAACCTCTG 
CTCCCGGAGA CGGTCACAGC 
GGCGCGTCAG CGGGTGTTGG 



TCTACGGGGT CTGACGCTCA 
TTATCAAAAA 612 0 

TTAAATTAAA AATGAAGTTT 
AGTTACCAAT 6210 
ATCTCAGCGA TCTGTCTATT 
CTGAGGTCTG 6 3 00 

TGACTCATAC CAGGCCTGAA 
GATGAGAGCT 63 90 

GGTGATTTTG AACTTTTGCT 
CTGATCCTTC 64 8 0 

TATTCAACAA AGCCGCCGTC 
TTAACCAATT 6570 
AGCATCAAAT GAAACTGCAA 
AGCCGTTTCT 6660 
CCGAGGCAGT TCCATAGGAT 
ACATCAATAC 67 50 

TCAAAAATAA GGTTATCAAG 
GGCAAAAGCT 684 0 

TGTTCAACAG GCCAGCCATT 
TTCATTCGTG 6 93 0 

AATACGCGAT CGCTGTTAAA 
AACACTGCCA 7 02 0 

CCTGAATCAG GATATTCTTC 
AGTAACCATG 7110 
AAATGCTTGA TGGTCGGAAG 
TCTGTAACAT 72 0 0 

CCATGTTTCA GAAACAACTC 
CCTGATTGCC 72 90 

TT AT AC C CAT ATAAATCAGC 
TCCCGTTGAA 73 8 0 

GTATTACTGT TTATGTAAGC 
GCAATGTAAC 74 7 0 

ACGTGGCTTT CCCCCCCCCC 
GGATACATAT 7560 
AAACAAATAG GGGTTCCGCG 
ATTATTATCA 7 65 0 

AGGCGTATCA CGAGGCCCTT 
ACACATGCAG 7 74 0 

TTGTCTGTAA GCGGATGCCG 
CGGGTGTCGG 7 83 0 



GTGGAACGAA AACTCACGTT 

TAAATCAATC TAAAGTATAT 

TCGTTCATCC ATAGTTGCCT 

TCGCCCCATC ATCCAGCCAG 

TTGCCACGGA ACGGTCTGCG 

CCGTCAAGTC AGCGTAATGC 

TTTATTCATA TCAGGATTAT 

GGCAAGATCC TGGTATCGGT 

TGAGAAATCA CCATGAGTGA 

ACGCTCGTCA TCAAAATCAC 

AGGACAATTA CAAACAGGAA 

TAATACCTGG AATGCTGTTT 

AGGCATAAAT TCCGTCAGCC 

TGGCGCATCG GGCTTCCCAT 

ATCCATGTTG GAATTTAATC 

AGACAGTTTT ATTGTTCATG 

CCATTATTGA AGCATTTATC 

CACATTTCCC CGAAAAGTGC 

TCGTCCTCGC GCGTTTCGGT 

GGAGCAGACA AGCCCGTCAG 
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GGCTGGCTTA ACTATGCGGC AT C AG AG C AG ATTGTACTGA GAGTGCACCA TATGCGGTGT 
GAAATACCGC ACAGATGCGT AAGGAGAAAA 7 92 0 

TACCGCATCA GATTGGCTAT 7 94 0 

SEQ ID NO: 50 
SIZE 7802 

DNA: R065: CMV-HIS - VP22 -T7RNAP fusion protein 

TGGCCATTGC ATACGTTGTA TCCATATCAT AATATGTACA TTTATATTGG CTCATGTCCA 
ACATTACCGC CATGTTGACA TTGATTATTG 90 

ACTAGTTATT AATAGTAATC AATTACGGGG TCATTAGTTC ATAGCCCATA TATGGAGTTC 
CGCGTTACAT AACTTACGGT AAATGGCCCG 180 

CCTGGCTGAC CGCCCAACGA CCCCCGCCCA TTGACGTCAA TAATGACGTA TGTTCCCATA 
GTAACGCCAA TAGGGACTTT CCATTGACGT 2 70 

CAATGGGTGG AGTATTTACG GTAAACTGCC C AC TTGG C AG TACATCAAGT GTATCATATG 
CCAAGTACGC CCCCTATTGA CGTCAATGAC 3 60 

GGTAAATGGC CCGCCTGGCA TTATGCCCAG TACATGACCT TATGGGACTT TCCTACTTGG 
CAGTACATCT ACGTATTAGT CATCGCTATT 4 50 

ACCATGGTGA TGCGGTTTTG GCAGTACATC AATGGGCGTG GATAGCGGTT TGACTCACGG 
GGATTTCCAA GTCTCCACCC CATTGACGTC 54 0 

AATGGGAGTT TGTTTTGGCA CCAAAATCAA CGGGACTTTC CAAAATGTCG TAACAACTCC 
GCCCCATTGA CGCAAATGGG CGGTAGGCGT 63 0 

GTACGGTGGG AGGTCTATAT AAGCAGAGCT CGTTTAGTGA ACCGTCAGAT CGCCTGGAGA 
CGCCATCCAC GCTGTTTTGA CCTCCATAGA 7 20 

AGACACCGGG ACCGATCCAG CCTCCGCGGC CGGGAACGGT GCATTGGAAC GCGGATTCCC 
CGTGCCAAGA GTGACGTAAG TACCGCCTAT 810 

AGAGTCTATA GGCCCACCCC CTTGGCTTCT TATGCATGCT ATACTGTTTT TGGCTTGGGG 
TCTATACACC CCCGCTTCCT CATGTTATAG 900 

GTGATGGTAT AGCTTAGCCT ATAGGTGTGG GTTATTGACC ATTATTGACC ACTCCAACGG 
TGGAGGGCAG TGTAGTCTGA GCAGTACTCG 9 90 

TTGCTGCCGC GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 
TGGGTCTTTT CTGCAGTCAC CGTCGTCGAC 10 8 0 

GGTATCGATA AGCTTGAATT CATGCACCAC CACCACCACC ACCACCACGA GCTTATTATG 
ACCTCTCGCC GCTCCGTGAA GTCGGGTCCG 117 0 

CGGGAGGTTC CGCGCGATGA GTACGAGGAT CTGTACTACA CCCCGTCTTC AGGTATGGCG 
AGTCCCGATA GTCCGCCTGA CACCTCCCGC 12 6 0 

CGTGGCGCCC TACAGACACG CTCGCGCCAG AGGGGCGAGG TCCGTTTCGT CCAGTACGAC 
GAGTCGGATT ATGCCCTCTA CGGGGGCTCG 13 50 

TCTTCCGAAG ACGACGAACA CCCGGAGGTC CCCCGGACGC GGCGTCCCGT TTCCGGGGCG 
GTTTTGTCCG GCCCGGGGCC TGCGCGGGCG 14 4 0 

CCTCCGCCAC CCGCTGGGTC CGGAGGGGCC GGACGCACAC CCACCACCGC CCCCCGGGCC 
CCCCGAACCC AGCGGGTGGC GTCTAAGGCC 153 0 
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CCCGCGGCCC CGGCGGCGGA GACCACCCGC 
GCACTCCCAG ACGCCCCCGC GTCGACGGCG 
CCAACCCGAT CCAAGACACC CGCGCAGGGG 
CCCCCAAACC CCGACGCGCC ATGGACCCCC 
CGGGTGGCCG GCTTTAACAA GCGCGTCTTC 
CATGCCCGGA TGGCGGCTGT CCAGCTCTGG 
GACATGTCGC GTCCGCGCAC AGACGAAGAC 
CGCGTGACGG TCTGCGAGGG CAAAAACCTG 
CTTCAGCGCG CCAACGAGTT GGTGAATCCA 
GCGACTCGAG GGCGTTCTGC GGCGTCGCGC 
CCCACCGAGC GACCTCGAGC CCCAGCCCGC 
GCCACAACCA TGGACACGAT TAACATCGCT 
AAGAACGACT TCTCTGACAT CGAACTGGCT 
TACGGTGAGC GTTTAGCTCG CGAACAGTTG 
GCCCTTGAGC ATGAGTCTTA CGAGATGGGT 
CAACTTAAAG CTGGTGAGGT TGCGGATAAC 
GCTGCCGCCA AGCCTCTCAT CACTACCCTA 
TGGTTTGAGG AAGTGAAAGC TAAGCGCGGC 
AAGCGCCCGA CAGCCTTCCA GTTCCTGCAA 
ACCATTAAGA CCACTCTGGC TTGCCTAACC 
AGTGCTGACA ATACAACCGT TCAGGCTGTA 
GAGGCTCGCT TCGGTCGTAT CCGTGACCTT 
GAAGCTAAGC ACTTCAAGAA AAACGTTGAG 
TACAAGAAAG CATTTATGCA AGTTGTCGAG 
GCTGACATGC TCTCTAAGGG TCTACTCGGT 
GACTCTATTC ATGTAGGAGT ACGCTGCATC 
GAGATGCTCA TTGAGTCAAC CGGAATGGTT 
GGTCAAGACT CTGAGACTAT CGAACTCGCA 
CCTGAATACG CTGAGGCTAT CGCAACCCGT 
TTCCAACCTT GCGTAGTTCC TCCTAAGCCG 
TGGACTGGCA TTACTGGTGG TGGCTATTGG 
CGTACTCACA GTAAGAAAGC ACTGATGCGC 
TACGAAGACG TTTACATGCC TGAGGTGTAC 
TGGAAAATCA ACAAGAAAGT CCTAGCGGTC 
GCCAACGTAA TCACCAAGTG GAAGCATTGT 
GAAGAACTCC CGATGAAACC GGAAGACATC 
GACATGAATC CTGAGGCTCT CACCGCGTGG 
GACAAGGCTC GCAAGTCTCG CCGTATCAGC 
CTTGAGTTCA TGCTTGAGCA AGCCAATAAG 
TACAACATGG ACTGGCGCGG TCGTGTTTAC 



GGCAGGAAAT CGGCCCAGCC AGAATCCGCC 
1620 

CTGGCCAGAA AGCTGCACTT TAGCACCGCC 
1710 

TGCGCCGCGG TCGGGCGCCT GGCGGCCATG 
1800 

CTCAACGAAC TCCTTGGCAT CACCACCATC 
1890 

GACGTGGTGC AGGACGTCGA CGCGGCCACG 
1980 

TCCGCTTCTC GCCCCAGACG GCCCGTCGAG 
2070 

GCTATCCCGT TCAACACTCT GGCTGACCAT 
2160 

GAAGCACGCT TCCGCAAGAT GTTTGAGCGT 
2250 

CTCCCTAAGA TGATTGCACG CATCAACGAC 
2340 

GAAATCAAGC CGGAAGCCGT AGCGTACATC 
2430 

GCAAGCGCAA TCGGTCGGGC CATTGAGGAC 
2520 

GAACAACTCA ACAAGCGCGT AGGGCACGTC 
2610 

GGCGAGGCGT GGTCTTCGTG GCATAAGGAA 
2700 

AGCTTACACC GCCAAAATGC TGGCGTAGTA 
2790 

GCAGGTGCGC TGGCTGGCAT CTCTCCGATG 
2880 

GCTAACGGTC GTCGTCCTCT GGCGCTGGTG 
2970 

AAAGCGATTA ACATTGCGCA AAACACCGCA 
3060 

CCGGTCGAGG ACATCCCTGC GATTGAGCGT 
3150 

AAACGTGCTG CCGCTGCTGT GTACCGCAAG 
3240 

TTTGCTAACC ATAAGGCCAT CTGGTTCCCT 
3330 
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GCTGTGTCAA TGTTCAACCC 

AAAGGTAAAC CAATCGGTAA 

TACTGGCTGA AAATCCACGG 

CGCATCAAGT TCATTGAGGA 

AACATCATGG CTTGCGCTAA 

CCGTTCTGCT TCCTTGCGTT 

TACGCTGGGG TACAGCACCA 

GGGTCTTGCT CTGGCATCCA 

GCGATGCTCC GAGATGAGGT 

CAGGACATCT ACGGGATTGT 

GTCAACGAGA TTCTACAAGC 

ACCGATGAGA ACACTGGTGA 

AAAGTCAAGC TGGGCACTAA 

AGTGTGACTA AGCGTTCAGT 

GCTTACGGGT CCAAAGAGTT 

GCTATTGATT CCGGCAAGGG 

ACTCAGCCGA ATCAGGCTGC 

ACGGTGGTAG CTGCGGTTGA 

TGGCTTAAGT CTGCTGCTAA 

ATTCTTCGCA AGCGTTGCGC 

GTAACTCCTG ATGGTTTCCC 

AACCTGATGT TCCTCGGTCA 

CAGCCTACCA TTAACACCAA 

ATCGCTCCTA ACTTTGTACA 

GGTAGCCACC TTCGTAAGAC 

GCACTGATTC ACGACTCCTT 

CCGGCTGACG CTGCGAACCT 

TCTTGTGATG TACTGGCTGA 

CAGTTCGCTG ACCAGTTGCA 

GGTAACTTGA ACCTCCGTGA 

TCGGACTTCG CGTTCGCGTA 

GCAGCTTATA ATGGTTACAA 

AG CAT C AC AA ATTTCACAAA 

AAACTCATCA ATGTATCTTA 

ATCTGGTTAC ATCGAATTCA 

CAGATCTTTT TCCCTCGCCA 

GGACATCATG AAGCCCCTTG 

GCAATAGTGT GTTGGAATTT 

TCACTCGGAA GGACATATGG 

TAGAGTTTGG CAACATATGC 



GCAAGGTAAC GATATGACCA 
GGAAGGTTAC 34 2 0 

TGCAAACTGT GCGGGTGTCG 
AAACCACGAG 3 510 

GTCTCCACTG GAGAACACTT 
CTGCTTTGAG 3 600 

CGGCCTGAGC TATAACTGCT 
GCACTTCTCC 3 6 90 

AGGTGGTCGC GCGGTTAACT 
TGCTAAGAAA 3 7 80 

AGACGCAATC AATGGGACCG 
AATCTCTGAG 3 8 7 0 

GGCACTGGCT GGTCAATGGC 
CATGACGCTG 3 96 0 

CGGCTTCCGT CAACAAGTGC 
TCTGATGTTC 4 050 

TGGATACATG GCTAAGCTGA 
AGCAATGAAC 414 0 

GCTGCTGGCT GCTGAGGTCA 
TGTGCATTGG 42 3 0 

TGTGTGGCAG GAATACAAGA 
GTTCCGCTTA 432 0 

C AAAG AT AG C GAGATTGATG 
CAGCCAAGAC 4410 
TGTAGTGTGG GCACACGAGA 
CGGTACCATT 4 500 

GTTCAAAGCA GTGCGCGAAA 
TTTCTACGAC 4 5 90 

CGAGTCTCAA TTGGACAAAA 
CATCTTAGAG 4 68 0 

ACGCCAAATC AATACGACTC 
ATAAAGCAAT 477 0 

TAAAGCATTT TTTTCACTGC 
TCATGTCTGG 4 860 

CGTGGGCCCG GTACCGTATA 
AAAATT ATGG 4 95 0 

AGCATCTGAC TTCTGGCTAA 
TTTGTGTCTC 5 04 0 

GAGGGCAAAT CATTTAAAAC 
CATTCTTCCG 513 0 



AAGGACTGCT TACGCTGGCG 

ATAAGGTTCC GTTCCCTGAG 

GGTGGGCTGA GCAAGATTCT 

CCCTTCCGCT GGCGTTTGAC 

TGCTTCCTAG TGAAACCGTT 

ATAACGAAGT AGTTACCGTG 

TGGCTTACGG TGTTACTCGC 

TGGAAGATAC CATTCAGCCA 

TTTGGGAATC TGTGAGCGTG 

AAGATAAGAA GACTGGAGAG 

AGCCTATTCA GACGCGCTTG 

CACACAAACA GGAGTCTGGT 

AGTACGGAAT CGAATCTTTT 

CTATGGTTGA CACATATGAG 

TGCCAGCACT TCCGGCTAAA 

CGGATCTGAA CTTGTTTATT 

ATTCTAGTTG TGGTTTGTCC 

CTCTAGAGCG GCCGCGGATC 

TAAAGGAAAT TTATTTCATT 

ATCAGAATCA GTATTTGGTT 
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CTTCCTCGCT CACTGACTCG 

ACTCAAAGGC GGTAATACGG 

AATCAGGGGA TAACGCAGGA 

GTAAAAAGGC CGCGTTGCTG 

ATAGGCTCCG CCCCCCTGAC 

ACCCGACAGG ACTATAAAGA 

TTCCCCCTGG AAGCTCCCTC 

TGTCCGCCTT TCTCCCTTCG 

CGCTTTCTCA ATGCTCACGC 

TGGGCTGTGT GCACGAACCC 

CCGACCGCTG CGCCTTATCC 

TATCGCCACT GGCAGCAGCC 

GGATTAGCAG AG CG AGGT AT 

ACGGCTACAC TAGAAGGACA 

TCTGCGCTCT GCTGAAGCCA 

AACAAACCAC CGCTGGTAGC 

TTGTTTGCAA GCAGCAGATT 

TTTCTACGGG GTCTGACGCT 

AAAACTCACG TTAAGGGATT 

TTTTAAATTA AAAATGAAGT 

TCTAAAGTAT ATATGAGTAA 

CTATCTCAGC GATCTGTCTA 

CCATAGTTGC CTGACTCCGG 

GCTGACTCAT ACCAGGCCTG 

TCATCCAGCC AGAAAGTGAG 

TTGGTGATTT TGAACTTTTG 

GAACGGTCTG CGTTGTCGGG 

TTTATTCAAC AAAGCCGCCG 

TCAGCGTAAT GCTCTGCCAG 

CGAGCATCAA ATGAAACTGC 

TATCAGGATT ATCAATACCA 

CACCGAGGCA GTTCCATAGG 

CCTGGTATCG GTCTGCGATT 

CGTCAAAAAT AAGGTTATCA 

CACCATGAGT GACGACTGAA 

CTTGTTCAAC AGGCCAGCCA 

CATCAAAATC ACTCGCATCA 

GAAATACGCG ATCGCTGTTA 

TACAAACAGG AATCGAATGC 

CACCTGAATC AGGATATTCT 



CTGCGCTCGG TCGTTCGGCT 
TTATCCACAG 52 2 0 

AAGAACATGT GAGCAAAAGG 
GCGTTTTTCC 5310 
GAGCATCACA AAAATCGACG 
TACCAGGCGT 54 0 0 

GTGCGCTCTC CTGTTCCGAC 
GGAAGCGTGG 54 90 

TGTAGGTATC TCAGTTCGGT 
CCCGTTCAGC 55 80 

GGTAACTATC GTCTTGAGTC 
ACTGGTAACA 567 0 

GTAGGCGGTG CTACAGAGTT 
GTATTTGGTA 576 0 

GTTACCTTCG GAAAAAGAGT 
GGTGGTTTTT 5850 
ACGCGCAGAA AAAAAGG AT C 
CAGTGGAACG 5 94 0 

TTGGTCATGA G ATTAT C AAA 
TTTAAATCAA 603 0 

ACTTGGTCTG ACAGTTACCA 
TTTCGTTCAT 612 0 

GGGGGGGGGG CGCTGAGGTC 
AATCGCCCCA 6210 
GGAGCCACGG TTGATGAGAG 
CTTTGCCACG 63 0 0 

AAGATGCGTG ATCTGATCCT 
TCCCGTCAAG 63 90 

TGTTACAACC AATTAACCAA 
AATTTATTCA 64 80 

TATTTTTGAA AAAGCCGTTT 
ATGGCAAGAT 657 0 

CCGACTCGTC CAACATCAAT 
AG TG AG AAAT 6660 
TCCGGTGAGA ATGG C AAAAG 
TTACGCTCGT 67 5 0 

AC C AAACCGT TATTCATTCG 
AAAGG AC AAT 684 0 

AACCGGCGCA GGAACACTGC 
TCTAATACCT 6 93 0 



GCGGCGAGCG GTATCAGCTC 
CCAGCAAAAG GCCAGGAACC 
CTCAAGTCAG AGGTGGCGAA 
CCTGCCGCTT ACCGGATACC 
GTAGGTCGTT CGCTCCAAGC 
CAACCCGGTA AGACACGACT 
CTTGAAGTGG TGGCCTAACT 
TGGTAGCTCT TGATCCGGCA 
TCAAGAAGAT CCTTTGATCT 
AAGGATCTTC ACCTAGATCC 
ATGCTTAATC AGTGAGGCAC 
TGCCTCGTGA AGAAGGTGTT 
CTTTGTTGTA GGTGGACCAG 
TCAACTCAGC AAAAGTTCGA 
TTCTGATTAG AAAAACTCAT 
CTGTAATGAA GGAGAAAACT 
ACAACCTATT AATTTCCCCT 
CTTATGCATT TCTTTCCAGA 
TGATTGCGCC TGAGCGAGAC 
CAGCGCATCA ACAATATTTT 
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GGAATGCTGT TTTCCCGGGG ATCGCAGTGG TGAGTAACCA TGCATCATCA GGAGTACGGA 
TAAAATGCTT GATGGTCGGA AGAGGCATAA 7 02 0 

ATTCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT 
TGCCATGTTT CAGAAACAAC TCTGGCGCAT 7110 

CGGGCTTCCC ATACAATCGA TAGATTGTCG CACCTGATTG CCCGACATTA TCGCGAGCCC 
ATTTATACCC AT AT AAAT C A GCATCCATGT 72 0 0 

TGGAATTTAA TCGCGGCCTC GAG C AAG AC G TTTCCCGTTG AATATGGCTC ATAACACCCC 
TTGTATTACT GTTTATGTAA GCAGACAGTT 7 2 90 

TTATTGTTCA TGATGATATA TTTTTATCTT GTGCAATGTA ACATCAGAGA TTTTGAGACA 
CAACGTGGCT TTCCCCCCCC CCCCATTATT 7 3 80 

GAAGCATTTA TCAGGGTTAT TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA 
ATAAACAAAT AGGGGTTCCG CGCACATTTC 74 7 0 

CCCGAAAAGT GCCACCTGAC GTCTAAGAAA CCATTATTAT CATGACATTA ACCTATAAAA 
ATAGGCGTAT CACGAGGCCC TTTCGTCCTC 7 560 

GCGCGTTTCG GTGATGACGG TGAAAACCTC TGACACATGC AGCTCCCGGA GACGGTCACA 
GCTTGTCTGT AAGCGGATGC CGGG AG CAG A 7 65 0 

CAAGCCCGTC AGGGCGCGTC AGCGGGTGTT GGCGGGTGTC GGGGCTGGCT TAACTATGCG 
GCATCAGAGC AGATTGTACT GAGAGTGCAC 7 74 0 

CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AAT AC CG CAT CAGATTGGCT AT 
7802 

SEQ ID NO: 51 
SIZE 7002 

DNA: R071 CMV- I L2 - signal sequence + first 11 amino acids of IL-2 fused 
T7 RNAP 



TGGCCATTGC 


ATACGTTGTA 


TCCATATCAT 


AATATGTACA 


TTTATATTGG 


CTCATGTCCA 


ACATTACCGC 


CATGTTGACA 


TTGATTATTG 


90 






ACTAGTTATT 


AATAGTAATC 


AATTACGGGG 


TCATTAGTTC 


AT AG C C CAT A 


TATGGAGTTC 


CGCGTTACAT 


AACTTACGGT 


AAATGGCCCG 


180 






CCTGGCTGAC 


CGCCCAACGA 


CCCCCGCCCA 


TTGACGTCAA 


TAATGACGTA 


TGTTCCCATA 


GTAACGCCAA 


TAGGGACTTT 


CCATTGACGT 


270 






CAATGGGTGG 


AGTATTTACG 


GTAAACTGCC 


CACTTGGCAG 


TACATCAAGT 


GTATCATATG 


CCAAGTACGC 


CCCCTATTGA 


CGTCAATGAC 


360 






GGTAAATGGC 


CCGCCTGGCA 


TTATGCCCAG 


TACATGACCT 


TATGGGACTT 


TCCTACTTGG 


CAGTACATCT 


ACGTATTAGT 


CATCGCTATT 


450 






ACCATGGTGA 


TGCGGTTTTG 


GCAGTACATC 


AATGGGCGTG 


GATAGCGGTT 


TGACTCACGG 


GGATTTCCAA 


GTCTCCACCC 


CATTGACGTC 


540 






AATGGGAGTT 


TGTTTTGGCA 


CCAAAATCAA 


CGGGACTTTC 


CAAAATGTCG 


TAACAACTCC 


GCCCCATTGA 


CGCAAATGGG 


CGGTAGGCGT 


630 






GTACGGTGGG 


AGGTCTATAT 


AAGCAGAGCT 


CGTTTAGTGA 


ACCGTCAGAT 


CGCCTGGAGA 


CGCCATCCAC 


GCTGTTTTGA 


CCTCCATAGA 


720 







87 



AGACACCGGG ACCGATCCAG 
CGTGCCAAGA GTGACGTAAG 
AGAGTCTATA GGCCCACCCC 
TCTATACACC CCCGCTTCCT 
GTGATGGTAT AGCTTAGCCT 
TGGAGGGCAG TGTAGTCTGA 
TTGCTGCCGC GCGCGCCACC 
TGGGTCTTTT CTGCAGTCAC 
CATGTACCGG ATGCAGCTGC 
TGCCCCCACC TCTTCTTCTA 
CCAGGCCACA ACCATGGGCC 
CCGCCAGCGC CGCCGCGGCA 
TAACATCGCT AAGAACGACT 
GGCTGACCAT TACGGTGAGC 
CGAACAGTTG GCCCTTGAGC 
GTTTGAGCGT CAACTTAAAG 
TGCGGATAAC GCTGCCGCCA 
CATCAACGAC TGGTTTGAGG 
TAAGCGCGGC AAGCGCCCGA 
AGCGTACATC ACCATTAAGA 
TTGCCTAACC AGTGCTGACA 
CATTGAGGAC GAGGCTCGCT 
CCGTGACCTT GAAGCTAAGC 
AGGGCACGTC TACAAGAAAG 
AGTTGTCGAG GCTGACATGC 
GCATAAGGAA GACTCTATTC 
ACGCTGCATC GAGATGCTCA 
TGGCGTAGTA GGTCAAGACT 
CGAACTCGCA CCTGAATACG 
CTCTCCGATG TTCCAACCTT 
TCCTAAGCCG TGGACTGGCA 
GGCGCTGGTG CGTACTCACA 
ACTGATGCGC TACGAAGACG 
AAACACCGCA TGGAAAATCA 
CCTAGCGGTC GCCAACGTAA 
GATTGAGCGT GAAGAACTCC 
GGAAGACATC GACATGAATC 
GTACCGCAAG GACAAGGCTC 
CCGTATCAGC CTTGAGTTCA 
CTGGTTCCCT TACAACATGG 



CCTCCGCGGC CGGGAACGGT 
TACCGCCTAT 810 
CTTGGCTTCT TATGCATGCT 
CATGTTATAG 90 0 

ATAGGTGTGG GTTATTGACC 
GCAGTACTCG 990 
AGACATAATA GCTGACAGAC 
CGTCGTCGAC 1080 
TGTCTTGCAT CGCCCTGTCT 
CCAAGAAGAC 117 0 

ACCACCACCA CCACCACGTG 
TGGACACGAT 1260 
TCTCTGACAT CGAACTGGCT 
GTTTAGCTCG 13 5 0 

ATGAGTCTTA CGAGATGGGT 
CTGGTGAGGT 144 0 

AGCCTCTCAT CACTACCCTA 
AAGTGAAAGC 153 0 

CAGCCTTCCA GTTCCTGCAA 
CCACTCTGGC 162 0 

ATACAACCGT TCAGGCTGTA 
TCGGTCGTAT 1710 
ACTTCAAGAA AAACGTTGAG 
CATTTATGCA 1800 
TCTCTAAGGG TCTACTCGGT 
ATGTAGGAGT 1890 
TTGAGTCAAC CGGAATGGTT 
C TGAGACT AT 1980 
CTGAGGCTAT CGCAACCCGT 
GCGTAGTTCC 2 07 0 

TTACTGGTGG TGGCTATTGG 
GTAAGAAAGC 2160 
TTTACATGCC TGAGGTGTAC 
ACAAGAAAGT 22 50 

TCACCAAGTG GAAGCATTGT 
CGATGAAACC 2 34 0 

CTGAGGCTCT CACCGCGTGG 
GCAAGTCTCG 2 4 3 0 

TGCTTGAGCA AGCCAATAAG 
ACTGGCGCGG 2 52 0 



GCATTGGAAC GCGGATTCCC 
ATACTGTTTT TGGCTTGGGG 
ATTATTGACC ACTCCAACGG 
TAACAGACTG TTCCTTTCCA 
CTGGCCCTGG TGACCAACTC 
ATGTACGGCC GCAAGAAGCG 
GCTATCCCGT TCAACACTCT 
GAAGCACGCT TCCGCAAGAT 
CTCCCTAAGA TGATTGCACG 
GAAATCAAGC CGGAAGCCGT 
GCAAGCGCAA TCGGTCGGGC 
GAACAACTCA ACAAGCGCGT 
GGCGAGGCGT GGTCTTCGTG 
AGCTTACACC GCCAAAATGC 
GCAGGTGCGC TGGCTGGCAT 
GCTAACGGTC GTCGTCCTCT 
AAAGCGATTA ACATTGCGCA 
CCGGTCGAGG ACATCCCTGC 
AAACGTGCTG CCGCTGCTGT 
TTTGCTAACC ATAAGGCCAT 
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TCGTGTTTAC GCTGTGTCAA 
TACGCTGGCG AAAGGTAAAC 
GGAAGGTTAC TACTGGCTGA 
GTTCCCTGAG CGCATCAAGT 
AAACCACGAG AACATCATGG 
GCAAGATTCT CCGTTCTGCT 
CTGCTTTGAG TACGCTGGGG 
GGCGTTTGAC GGGTCTTGCT 
GCACTTCTCC GCGATGCTCC 
TGAAACCGTT CAGGACATCT 
TGCTAAGAAA GTCAACGAGA 
AGTTACCGTG ACCGATGAGA 
AATCTCTGAG AAAGTCAAGC 
TGTTACTCGC AGTGTGACTA 
CATGACGCTG GCTTACGGGT 
CATTCAGCCA G CTATTG ATT 
TCTGATGTTC ACTCAGCCGA 
TGTGAGCGTG ACGGTGGTAG 
AGCAATGAAC TGGCTTAAGT 
GAC TGGAGAG ATTCTTCGCA 
TGTGCATTGG GTAACTCCTG 
GACGCGCTTG AACCTGATGT 
GTTCCGCTTA CAGCCTACCA 
GGAGTCTGGT ATCGCTCCTA 
CAGCCAAGAC GGTAGCCACC 
CGAATCTTTT GCACTGATTC 
CGGTACCATT CCGGCTGACG 
CACATATGAG TCTTGTGATG 
TTTCTACGAC CAGTTCGCTG 
TCCGGCTAAA GGTAACTTGA 
CATCTTAGAG TCGGACTTCG 
CTTGTTTATT GCAGCTTATA 
ATAAAGCAAT AGCATCACAA 
TGGTTTGTCC AAACTCATCA 
TCATGTCTGG ATCTGGTTAC 
GCCGCGGATC CAGATCTTTT 
AAAATTATGG GGACATCATG 
TTATTTCATT GCAATAGTGT 
TTTGTGTCTC TCACTCGGAA 
GTATTTGGTT TAGAGTTTGG 



TGTTCAACCC GCAAGGTAAC 
CAATCGGTAA 2 610 

AAATCCACGG TGCAAACTGT 
TCATTGAGGA 2700 
CTTGCGCTAA GTCTCCACTG 
TCCTTGCGTT 2 7 90 

TACAGCACCA CGGCCTGAGC 
CTGGCATCCA 2 8 80 

GAGATGAGGT AGGTGGTCGC 
ACGGGATTGT 2 97 0 

TTCTACAAGC AGACGCAATC 
ACACTGGTGA 3 060 

TGGGCACTAA GGCACTGGCT 
AGCGTTCAGT 315 0 

CCAAAGAGTT CGGCTTCCGT 
CCGGCAAGGG 324 0 

ATCAGGCTGC TGGATACATG 
CTGCGGTTGA 3 3 3 0 

CTGCTGCTAA GCTGCTGGCT 
AGCGTTGCGC 34 2 0 

ATGGTTTCCC TGTGTGGCAG 
TCCTCGGTCA 3510 
TTAACACCAA C AAAG AT AG C 
ACTTTGTACA 3 600 

TTCGTAAGAC TGTAGTGTGG 
ACGACTCCTT 3 690 

CTGCGAACCT GTTCAAAGCA 
TACTGGCTGA 3 78 0 

ACCAGTTGCA CGAGTCTCAA 
ACCTCCGTGA 3 87 0 

CGTTCGCGTA ACGCCAAATC 
ATGGTTACAA 3 96 0 

ATTTCACAAA TAAAGCATTT 
ATGTATCTTA 4 050 

ATCGAATTCA CGTGGGCCCG 
TCCCTCGCCA 414 0 

AAGCCCCTTG AGCATCTGAC 
GTTGGAATTT 4 2 3 0 

GGACATATGG GAGGGCAAAT 
CAACATATGC 4 32 0 



GATATGACCA AAGGACTGCT 
GCGGGTGTCG ATAAGGTTCC 
GAGAACACTT GGTGGGCTGA 
TATAACTGCT CCCTTCCGCT 
GCGGTTAACT TGCTTCCTAG 
AATGGGACCG ATAACGAAGT 
GGTCAATGGC TGGCTTACGG 
CAACAAGTGC TGGAAGATAC 
GCTAAGCTGA TTTGGGAATC 
GCTGAGGTCA AAGATAAGAA 
GAATACAAGA AGCCTATTCA 
GAGATTGATG CACACAAACA 
GCACACGAGA AGTACGGAAT 
GTGCGCGAAA CTATGGTTGA 
TTGGACAAAA TGCCAGCACT 
AATACGACTC CGGATCTGAA 
TTTTCACTGC ATTCTAGTTG 
GTACCGTATA CTCTAGAGCG 
TTCTGGCTAA TAAAGGAAAT 
CATTTAAAAC ATCAGAATCA 
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CATTCTTCCG CTTCCTCGCT 
GTATCAGCTC ACTCAAAGGC 
TTATCCACAG AATCAGGGGA 
GCCAGGAACC GTAAAAAGGC 
GCGTTTTTCC ATAGGCTCCG 
AGGTGGCGAA ACCCGACAGG 
TACCAGGCGT TTCCCCCTGG 
ACCGGATACC TGTCCGCCTT 
GGAAGCGTGG CGCTTTCTCA 
CGCTCCAAGC TGGGCTGTGT 
CCCGTTCAGC CCGACCGCTG 
AGACACGACT TATCGCCACT 
ACTGGTAACA GG ATT AG C AG 
TGGCCTAACT ACGGCTACAC 
GTATTTGGTA TCTGCGCTCT 
TGATCCGGCA AACAAACCAC 
GGTGGTTTTT TTGTTTGCAA 
CCTTTGATCT TTTCTACGGG 
CAGTGGAACG AAAACTCACG 
ACCTAGATCC TTTTAAATTA 
TTTAAATCAA TCTAAAGTAT 
AGTGAGGCAC CTATCTCAGC 
TTTCGTTCAT CCATAGTTGC 
AGAAGGTGTT GCTGACTCAT 
AATCGCCCCA TCATCCAGCC 
GGTGGACCAG TTGGTGATTT 
CTTTGCCACG GAACGGTCTG 
AAAAGTTCGA TTTATTCAAC 
TCCCGTCAAG TCAGCGTAAT 
AAAAACTCAT CGAGCATCAA 
AATTTATTCA TATCAGGATT 
GGAGAAAACT CACCGAGGCA 
ATGGCAAGAT CCTGGTATCG 
AATTTCCCCT CGTCAAAAAT 
AGTGAGAAAT CACCATGAGT 
TCTTTCCAGA CTTGTTCAAC 
TTACGCTCGT CATCAAAATC 
TGAGCGAGAC GAAATACGCG 
AAAGGACAAT TACAAACAGG 
ACAATATTTT CACCTGAATC 



CACTGACTCG CTGCGCTCGG 
GGTAATACGG 4 410 

TAACGCAGGA AAGAACATGT 
CGCGTTGCTG 4500 
CCCCCCTGAC GAGCATCACA 
ACTATAAAGA 4 590 

AAGCTCCCTC GTGCGCTCTC 
TCTCCCTTCG 4 68 0 

ATGCTCACGC TGTAGGTATC 
GCACGAACCC 4 77 0 

CGCCTTATCC GGTAACTATC 
GGCAGCAGCC 4 860 

AGCGAGGTAT GTAGGCGGTG 
TAGAAGGACA 4 95 0 

GCTGAAGCCA GTTACCTTCG 
CGCTGGTAGC 5 04 0 

GCAGCAGATT ACGCGCAGAA 
GTCTGACGCT 513 0 

TTAAGGGATT TTGGTCATGA 
AAAATGAAGT 52 2 0 

ATATGAGTAA ACTTGGTCTG 
GATCTGTCTA 5 310 

CTGACTCCGG GGGGGGGGGG 
ACCAGGCCTG 54 00 

AGAAAGTGAG GGAGCCACGG 
TGAACTTTTG 54 90 

CGTTGTCGGG AAGATGCGTG 
AAAGCCGCCG 55 8 0 

GCTCTGCCAG TGTTACAACC 
ATGAAACTGC 5 670 

ATCAATACCA TATTTTTGAA 
GTTCCATAGG 57 60 

GTCTGCGATT CCGACTCGTC 
AAGGTTATCA 5 85 0 

GACGACTGAA TCCGGTGAGA 
AGGCCAGCCA 5 94 0 

ACTCGCATCA ACCAAACCGT 
ATCGCTGTTA 6 03 0 

AATCGAATGC AACCGGCGCA 
AGGATATTCT 612 0 



TCGTTCGGCT GCGGCGAGCG 

GAGCAAAAGG CCAGCAAAAG 

AAAATCGACG CTCAAGTCAG 

CTGTTCCGAC CCTGCCGCTT 

TCAGTTCGGT GTAGGTCGTT 

GTCTTGAGTC CAACCCGGTA 

CTACAGAGTT CTTGAAGTGG 

GAAAAAGAGT TGGTAGCTCT 

AAAAAGG AT C TCAAGAAGAT 

GATTATCAAA AAGGATCTTC 

ACAGTTACCA ATGCTTAATC 

CGCTGAGGTC TGCCTCGTGA 

TTGATGAGAG CTTTGTTGTA 

ATCTGATCCT TCAACTCAGC 

AATTAACCAA TTCTGATTAG 

AAAGCCGTTT CTGTAATGAA 

CAACATCAAT ACAACCTATT 

ATGGCAAAAG CTTATGCATT 

TATTCATTCG TGATTGCGCC 
GG AACACTGC CAGCGCATCA 
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TCTAATACCT GGAATGCTGT 

GGAGTACGGA TAAAATGCTT 

AGAGGCATAA ATTCCGTCAG 

ACGCTACCTT TGCCATGTTT 

TCTGGCGCAT CGGGCTTCCC 

TCGCGAGCCC ATTTATACCC 

GCATCCATGT TGGAATTTAA 

ATAACACCCC TTGTATTACT 

GCAGACAGTT TTATTGTTCA 

TTTTGAGACA CAACGTGGCT 

CCCCATTATT GAAGCATTTA 

ATTTAGAAAA ATAAACAAAT 
CGCACATTTC CCCGAAAAGT 

ACCTATAAAA ATAGGCGTAT 
TTTCGTCCTC GCGCGTTTCG 
GACGGTCACA GCTTGTCTGT 
CGGGAGCAGA CAAGCCCGTC 
TAACTATGCG GCATCAGAGC 
GAGAGTGCAC CATATGCGGT 
CAGATTGGCT AT 



TTTCCCGGGG ATCGCAGTGG 
GATGGTCGGA 6 210 

CCAGTTTAGT CTGACCATCT 
CAGAAACAAC 63 00 

ATACAATCGA TAGATTGTCG 
ATATAAATCA 6 3 90 

TCGCGGCCTC GAGCAAGACG 
GTTTATGTAA 64 80 

TGATGATATA TTTTTATCTT 
TTCCCCCCCC 657 0 

TCAGGGTTAT TGTCTCATGA 
AGGGGTTCCG 6660 
GCCACCTGAC GTCTAAGAAA 
CACGAGGCCC 675 0 

GTGATGACGG TGAAAACCTC 
AAGCGGATGC 6 84 0 

AGGGCGCGTC AGCGGGTGTT 
AGATTGTACT 6 93 0 

GTGAAATACC GCACAGATGC 
7002 



TGAGTAACCA TGCATCATCA 

CATCTGTAAC ATCATTGGCA 

CACCTGATTG CCCGACATTA 

TTTCCCGTTG AATATGGCTC 

GTGCAATGTA ACATCAGAGA 

GCGGATACAT ATTTGAATGT 

CCATTATTAT CATGACATTA 

TGACACATGC AGCTCCCGGA 

GGCGGGTGTC GGGGCTGGCT 

GTAAGGAGAA AATACCGCAT 
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